, Bibliographie personnelle

N. Lossing, Analyse sémantique des tableaux, des structures et des pointeurs. Rapp. tech, 2013.

N. Lossing, PIPS : Tutorial to use PIPS with Eclipse. Rapp. tech, 2014.

N. Lossing, C. Ancourt, and F. Irigoin, Automatic Code Generation of Distributed Parallel Tasks, 2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES), pp.234-241
DOI : 10.1109/CSE-EUC-DCABES.2016.190

URL : https://hal.archives-ouvertes.fr/hal-01359468

N. Lossing, P. Guillou, M. Amini, and F. Irigoin, From Data to Effects Dependence Graphs : Source-to-Source Transformations for C, 18th International Workshop on Compilers for Parallel Computing, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01254426

N. Lossing, P. Guillou, and F. Irigoin, Effects Dependence Graph: A Key Data Concept for C Source-to-Source Compilers, 2016 IEEE 16th International Working Conference on Source Code Analysis and Manipulation (SCAM), pp.2016-167, 2016.
DOI : 10.1109/SCAM.2016.20

URL : https://hal.archives-ouvertes.fr/hal-01359465

A. Acharya and U. Bondhugula, PLUTO+ : Near-complete Modeling of Affine Transformations for Parallelism and Locality, Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. PPoPP 2015, pp.54-64, 2015.

L. Adhianto, G. Jin, M. Krentel, J. Mellor-crummey, W. N. Scherer et al., Coarray Fortran 2.0 Prealpha Release Notes. Rapp. tech. Rice University, 2012.

G. Vikram-adve, J. Jin, Q. Mellor-crummey, and . Yi, High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes, Proceedings of the 1998 ACM/IEEE Conference on Supercomputing . SC '98, pp.1-23, 1998.

A. V. Aho, R. Sethi, J. Ullman, and M. S. Lam, Compilers : principles, techniques, & tools. 2nd, 2006.

A. , url : https, p.2, 2000.

C. Ancourt, F. Coelho, F. Irigoin, and R. Keryell, A Linear Algebra Framework for Static High Performance Fortran Code Distribution, Scientific Programming, vol.6, issue.1, pp.3-27, 1997.
DOI : 10.1155/1997/195689

C. Ancourt, T. Viet, and N. Nguyen, Array resizing for scientific code debugging, maintenance and reuse, Proceedings of the 2001 ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering , PASTE '01, pp.32-37, 2001.
DOI : 10.1145/379605.379656

J. W. Backus, The Syntax and Semantics of the Proposed International Algebraic Language of the Zurich ACM-GAMM Conference, Proceedings of the International Conference on Information Processing, pp.125-131, 1960.

D. Bartz, B. Schneider, and C. Silva, Rendering and Visualization in Parallel Environments, ACM SIGGRAPH 2000 Courses. SIGGRAPH '00, 2000.

G. Bell and J. Gray, What's next in high-performance computing?, Communications of the ACM, vol.45, issue.2, pp.91-95, 2002.
DOI : 10.1145/503124.503129

G. Somashekaracharya, U. Bhaskaracharya, A. Bondhugula, and . Cohen, Automatic Storage Optimization for Arrays, In : ACM Trans. Program. Lang. Syst, vol.38, issue.11, pp.1-1123, 2016.

L. S. Blackford, J. Demmel, J. Dongarra, I. Duff, S. Hammarling et al., An updated set of basic linear algebra subprograms (BLAS), ACM Transactions on Mathematical Software, vol.28, issue.2, pp.135-151, 2002.
DOI : 10.1145/567806.567807

URL : http://icl.cs.utk.edu/news_pub/submissions/blast-toms.pdf

, Basic Linear Algebra Subprograms. url : http://www.netlib. org/blas, BLAS, p.2, 2017.

F. Bodin, L. Kervella, and T. Priol, Fortran-S, Proceedings of the 1993 ACM/IEEE conference on Supercomputing , Supercomputing '93, pp.274-283, 1993.
DOI : 10.1145/169627.169732

U. Bondhugula, Compiling affine loop nests for distributed-memory parallel architectures, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '13, pp.1-3312, 2013.
DOI : 10.1145/2503210.2503289

U. Bondhugula, A. Acharya, and A. Cohen, The Pluto+ Algorithm, ACM Transactions on Programming Languages and Systems, vol.38, issue.3, 2016.
DOI : 10.1109/IPDPS.2000.845979

URL : https://hal.archives-ouvertes.fr/hal-01425546

U. Bondhugula, V. Bandishti, A. Cohen, G. Potron, and N. Vasilache, Tiling and optimizing time-iterated computations on periodic domains, Proceedings of the 23rd international conference on Parallel architectures and compilation, PACT '14, pp.39-50, 2014.
DOI : 10.1007/BF02106826

URL : https://hal.archives-ouvertes.fr/hal-01257240

U. Bondhugula, V. Bandishti, and I. Pananilath, Diamond Tiling: Tiling Techniques to Maximize Parallelism for Stencil Computations, IEEE Transactions on Parallel and Distributed Systems, vol.28, issue.5, 2016.
DOI : 10.1109/TPDS.2016.2615094

, BSC Programming Models group. The OmpSs Programming Model. url : https://pm.bsc.es/ompss, p.2, 2017.

B. Chapman, P. Mehrotra, and H. Zima, Programming in Vienna Fortran, Scientific Programming, vol.1, issue.1, pp.31-50, 1992.
DOI : 10.1155/1992/258136

C. Coarfa, Y. Dotsenko, J. Mellor-crummey, F. Cantonnet, T. El-ghazawi et al., An evaluation of global address space languages, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '05, pp.36-47, 2005.
DOI : 10.1145/1065944.1065950

F. Coelho, Contributions à la compilation du High Performance Fortran, Theses. Ecole Nationale Supérieure des Mines de Paris, 1996.

B. Creusillet, Analyse de régions de tableaux et applications, Theses. Ecole Nationale Supérieure des Mines de Paris, 1996.

R. Dathathri, C. Reddy, T. Ramashekar, and U. Bondhugula, Generating Efficient Data Movement Code for Heterogeneous Architectures with Distributed-memory, Proceedings of the 22Nd International Conference on Parallel Architectures and Compilation Techniques. PACT '13, pp.375-386, 2013.

J. J. Dongarra, J. D. Croz, S. Hammarling, and I. S. Duff, A set of level 3 basic linear algebra subprograms, ACM Transactions on Mathematical Software, vol.16, issue.1, pp.1-17, 1990.
DOI : 10.1145/77626.79170

J. Dongarra, T. Sterling, H. Simon, and E. Strohmaier, High-Performance Computing: Clusters, Constellations, MPPs, and Future Directions, Computing in Science and Engineering, vol.7, issue.2, pp.51-59, 2005.
DOI : 10.1109/MCSE.2005.34

J. J. Dongarra, J. D. Croz, S. Hammarling, and R. J. Hanson, An extended set of FORTRAN basic linear algebra subprograms, ACM Transactions on Mathematical Software, vol.14, issue.1, pp.1-17, 1988.
DOI : 10.1145/42288.42291

P. Feautrier, Parametric integer programming " . eng, Recherche Opérationnelle, vol.223, pp.243-268, 1988.

J. A. Fisher, Trace Scheduling: A Technique for Global Microcode Compaction, IEEE Transactions on Computers, vol.30, issue.7, pp.478-490, 1981.
DOI : 10.1109/TC.1981.1675827

URL : http://www.ece.cmu.edu/~ece447/s13/lib/exe/fetch.php?media=01675827.pdf

J. A. Fisher, P. Faraboschi, and C. Young, Embedded Computing : A VLIW Approach to Architecture, Compilers and Tools, 2005.

J. Michael and . Flynn, Some Computer Organizations and Their Effectiveness, IEEE Trans. Comput, vol.21, issue.9, pp.948-960

T. Gautier, J. V. Lima, N. Maillard, and B. Raffin, XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, pp.1299-1308, 2013.
DOI : 10.1109/IPDPS.2013.66

URL : https://hal.archives-ouvertes.fr/hal-00799904

G. , Auto-vectorization in GCC. url : http://gcc.gnu.org/projects/ tree-ssa/vectorization.html (visité le 18, 2016.

A. Geist, A. Beguelin, J. Dongarra, and W. Jiang, Robert Manchek et Vaidy Sunderam PVM?parallel virtual machine : a users' guide and tutorial for networked parallel computing. Scientific and engineering computation, 1994.

D. Andrew, T. Gordon, . Melham-van-leeuwen, J. Joakim-von-wright, J. Grundy et al., Five axioms of alpha-conversion, Theorem Proving in Higher Order Logics : 9th International Conference Proceedings. Sous la dir. de Gerhard Goos, pp.173-190, 1996.

J. C. Michael and . Gordon, The Denotational Description of Programming Languages : An Introduction, 1979.

, RÉFÉRENCES

R. Habel, High Performance Programming for Hybrid Architectures, Theses. Ecole Nationale Supérieure des Mines de, 2014.
URL : https://hal.archives-ouvertes.fr/tel-01101782

R. Habel, F. Silber-chaussumier, F. Irigoin, E. Brunet, and F. Trahay, Combining Data and Computation Distribution Directives for Hybrid Parallel Programming : A Transformation System, International Journal of Parallel Programming, vol.26, issue.6, pp.1268-1295
DOI : 10.1145/113446.113449

C. Harris and M. Stephens, Proc. of Fourth Alvey Vision Conference, pp.147-151, 1988.

L. John, D. A. Hennessy, and . Patterson, Computer Architecture : A Quantitative Approach. 5th, 2012.

M. D. Jonathan, . Hill, D. C. Mccoll, M. W. Stefanescu, K. Goudreau et al., BSPlib : The BSP programming library, Parallel Computing, vol.24, pp.14-1947, 1998.

S. Hiranandani, K. Kennedy, and C. Tseng, Compiling Fortran D for MIMD distributed-memory machines, Communications of the ACM, vol.35, issue.8, pp.66-80, 1992.
DOI : 10.1145/135226.135230

H. High-performance-fortan, url : http://hpff.rice, p.2, 2017.

. Intel, Getting Started with Intel R Cilk TM Plus Array Notations. url : https : / / software . intel . com / sites / default / files / article / 185163/introduction-to-array-notation.pdf, 2017.

, Basic Architecture url : https://www-ssl. intel.com/content/dam, Intel. Intel R 64 and IA-32 Architectures Software Developer's Manual, p.2, 2016.

, url : https : / / software . intel . com / sites, Intel. Intel R Architecture Instruction Set Extensions Programming Reference . Fév, pp.319433-319457, 2016.

F. Irigoin and R. Triolet, Supernode partitioning, Proceedings of the 15th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '88, pp.319-329, 1988.
DOI : 10.1145/73560.73588

, (E) Information technology ? Portable Operating System Interface (POSIX R ) Base Specifications, Issue 7. Standard . International Organization for Standardization, sept, p.2009, 2009.

K. Kennedy and R. Allen, Optimizing Compilers for Modern Architectures : A Dependence-based Approach, 2001.

K. Kennedy, C. Koelbel, and H. Zima, The rise and fall of High Performance Fortran, Proceedings of the third ACM SIGPLAN conference on History of programming languages , HOPL III, pp.7-8, 2007.
DOI : 10.1145/1238844.1238851

D. Khaldi, Automatic Resource-Constrained Static Task Parallelization : A Generic Approach, Theses. Ecole Nationale Supérieure des Mines de, 2013.
URL : https://hal.archives-ouvertes.fr/pastel-00935483

D. Khaldi, P. Jouvelot, and C. Ancourt, Parallelizing with BDSC, a resource-constrained scheduling algorithm for shared and distributed memory systems, Parallel Computing, vol.41, pp.66-89, 2015.
DOI : 10.1016/j.parco.2014.11.004

URL : https://hal.archives-ouvertes.fr/hal-01097328

K. Group and . Khronos, url : https://www.khronos.org, 2017.

K. Group and . Opencl, The open standard for parallel programming of heterogeneous systems. url : https://www.khronos.org/opencl, p.2, 2017.

K. Group, The OpenCL Specification, Version : 2.2 url : https, 2016.

M. Klemm and C. Terboven, Full Throttle : OpenMP 4.0 " . In : The Parallel Universe Magazine, pp.6-16, 2013.

C. H. Koelbel, The High performance Fortran handbook. Scientific and engineering computation, 1994.

S. Larsen and S. Amarasinghe, Exploiting Superword Level Parallelism with Multimedia Instruction Sets, Proceedings of the ACM SIGPLAN 2000 Conference on Programming Language Design and Implementation. PLDI '00, pp.145-156, 2000.

C. L. Lawson, R. J. Hanson, D. R. Kincaid, and F. T. Krogh, Basic Linear Algebra Subprograms for Fortran Usage, ACM Transactions on Mathematical Software, vol.5, issue.3, pp.308-323
DOI : 10.1145/355841.355847

G. Malewicz, M. H. Austern, J. Aart, J. C. Bik, I. Dehnert et al., Pregel : A System for Large-scale Graph Processing, Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data. SIGMOD '10, pp.135-146, 2010.

F. William and . Mccoll, Foundations of Time-Critical Scalable Computing In : Fundamentals -Foundations of Computer Science, IFIP World Computer Congress, pp.93-107, 1998.

J. Mellor-crummey, L. Adhianto, W. N. Scherer, I. , and G. Jin, A New Vision for Coarray Fortran STEP : A Distributed OpenMP for Coarse-grain Parallelism Tool, Proceedings of the Third Conference on Partitioned Global Address Space Programing Models. PGAS '09 Proceedings of the 4th International Conference on OpenMP in a New Era of Parallelism. IWOMP'08. West, pp.1-5, 2008.

D. Millot and A. Muller, Christian Parrot et Frédérique Silber- Chaussumier From OpenMP to MPI : first experiments of the STEP source-to-source transformation tool In : Parallel Computing : From Multicores and GPU's to Petascale, Proceedings of the conference ParCo, pp.669-676, 2009.

M. Mittal, A. Peleg, and U. Weiser, MMX TM Technology Architecture Overview, In : Intel Technology Journal, vol.1, issue.3, 1997.

M. , Message Passing Interface. url : https://www.mpi-forum.org, p.2, 2017.

. Mpich, url : https://www.mpich.org, 2017.

H. B. Robert, . Netzer, P. Barton, and . Miller, What Are Race Conditions ? : Some Issues and Formalizations, In : ACM Lett. Program. Lang. Syst, vol.1, issue.1, pp.74-88, 1992.

. John-von-neumann, First draft of a report on the EDVAC, IEEE Annals of the History of Computing, vol.15, issue.4, pp.27-75, 1993.
DOI : 10.1109/85.238389

W. Robert, J. Numrich, and . Reid, Co-array Fortran for Parallel Programming, In : SIGPLAN Fortran Forum, vol.17, issue.2, pp.1-31, 1998.

W. Robert, J. Numrich, and . Reid, Co-arrays in the Next Fortran Standard, In : SIGPLAN Fortran Forum, vol.24, issue.2, pp.4-17, 2005.

N. Cuda, url : https://developer.nvidia.com/cuda-zone (visité le 02, p.2, 2017.

N. Toolkit, , p.2, 2017.

N. , C. Library, and U. Guide, , p.2, 2016.

N. and C. Library, , p.2, 2016.

M. Open, url : https://www.open-mpi, 2017.

. Openacc, , 2017.

. Openacc, The OpenACC R Application Programming Interface, Version 2.5, 2015.

. Openmp, , 2017.

A. Openmp, OpenMP Application Programming Interface, Version 4.5, 2015.

, Performance Application Programming Interface. url : http:// icl.utk, p.2, 2017.

K. Pearson, Note on Regression and Inheritance in the Case of Two Parents, Proceedings of the Royal Society of London (1854-1905), vol.58, issue.-1, pp.347-352, 1895.
DOI : 10.1098/rspl.1895.0041

P. , , 2017.

. Pluto, An automatic parallelizer and locality optimizer for affine loop nests. url : http : / / pluto -compiler . sourceforge . net, p.2, 2017.

. Polybench, url : https://sourceforge, 2017.

J. Regehr, Race Condition vs. Data Race url : http: //blog.regehr.org/archives, p.2, 2011.

G. Requilé and . Pvm, Parallel Virtual Machine Les aspects communication, p.2, 1995.

S. Michael, B. R. Schlansker, and . Rau, EPIC : Explicitly Parallel Instruction Computing, In : Computer, vol.33, issue.2, pp.37-45, 2000.

T. Sterling, D. J. Becker, D. Savarese, J. E. Dorband, A. Udaya et al., Beowulf : A Parallel Workstation For Scientific Computation, Proceedings of the 24th International Conference on Parallel Processing, pp.11-14, 1995.

V. S. Sunderam, PVM: A framework for parallel distributed computing, Concurrency : Pract. Exper. 2.4 (nov, pp.315-339, 1990.
DOI : 10.1002/cpe.4330020404

H. Sutter, The Free Lunch Is Over : A Fundamental Turn Toward Concurrency in Software, In : Dr. Dobb's Journal, vol.303, pp.202-210, 2005.

. Shreekant, . Ticky, T. Thakkar, and . Huff, The Internet Streaming SIMD Extensions, In : Intel Technology Journal, vol.3, issue.2, 1999.

, Thinking Machines Corporation. CM Fortran Reference Manual, Version 1.0, 1991.

M. Tillenius, E. Larsson, R. M. Badia, and X. Martorell, Resource-Aware Task Scheduling, ACM Transactions on Embedded Computing Systems, vol.14, issue.1, pp.1-525, 2015.
DOI : 10.1145/2379776.2379780

A. Tiskin, BSP (Bulk Synchronous Parallelism) . In : Encyclopedia of Parallel Computing, pp.192-199, 2011.

R. Triolet, Contribution a la parallélisation automatique de programmes Fortran comportant des appels de procédure, Theses. Ecole Nationale Supérieure des Mines de Paris, 1984.

C. Tseng, An optimizing Fortran D compiler for MIMD distributedmemory machines " . Theses. Rice University, 1993.

U. Consortium, UPC Specifications, Version 1.3, 2013.

G. Leslie and . Valiant, A Bridging Model for Parallel Computation, Commun. ACM, vol.338, pp.103-111, 1990.

E. Michael, M. S. Wolf, and . Lam, A Data Locality Optimizing Algorithm, Proceedings of the ACM SIGPLAN 1991 Conference on Programming Language Design and Implementation. PLDI '91, pp.30-44, 1991.

M. Wolfe, More iteration space tiling, Proceedings of the 1989 ACM/IEEE conference on Supercomputing , Supercomputing '89, pp.655-664, 1989.
DOI : 10.1145/76263.76337

M. Wolfe, High Performance Compilers for Parallel Computing . 1st, 1996.

H. P. Zima and B. M. Chapman, Compiling for distributed-memory systems, Proceedings of the IEEE 81.2 (fév, pp.264-287, 1993.
DOI : 10.1109/5.214550