M. Abadi, Large-Scale Machine Learning on Heterogeneous Distributed Systems

A. Agarwal, An Introduction to Computational Networks and the Computational Network Toolkit

A. Altera, . Sdk, and . Opencl, url : https://www.altera.com/products/ design-software/embedded-software-developers

M. Amini, « Source-to-Source Automatic Program Transformations for GPU-like Hardware Accelerators, Thèse de doct. MINES ParisTech, déc, 2012.

A. Technology, url : https : / / www . arm . com / products / processors/technologies/biglittleprocessing

P. Aubry, Extended Cyclostatic Dataflow Program Compilation and Execution for an Integrated Manycore Processor, et Peter M. A. Sloot. T. 18. Procedia Computer Science, pp.1624-1633, 2013.
DOI : 10.1016/j.procs.2013.05.330

URL : https://hal.archives-ouvertes.fr/hal-00832504

F. Bastien, Theano : new features and speed improvements. Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012.

S. Behnel, Cython: The Best of Both Worlds, Computing in Science & Engineering, vol.13, issue.2, pp.31-39, 2011.
DOI : 10.1109/MCSE.2010.118

J. Bergstra, Theano : a CPU and GPU Math Expression Compiler, Proceedings of the Python for Scientific Computing Conference (SciPy), 2010.

S. Beucher, « Segmentation d'images et morphologie mathématique, Thèse de doct. École Nationale Supérieure des Mines de Paris, 1990.

S. Beucher and F. Meyer, « The morphological approach to segmentation : the watershed transformation, OPTICAL ENGINEERING-NEW YORK-MARCEL DEKKER INCORPORATED-34, pp.433-433, 1992.

M. Bilodeau, FREIA : FRamework for Embedded Image Applications. French ANR-funded project with ARMINES (CMM, CRI), THALES (TRT) and Télécom Bretagne, 2008.

G. Bilsen, M. Engels, R. Lauwereins, and J. Peperstraete, Cycle-static dataflow, Cycle-static dataflow, pp.397-408, 1996.
DOI : 10.1109/78.485935

R. Openmp-architecture and . Board, OpenMP Application Programming Interface

P. Bonnot, Definition and SIMD Implementation of a Multi-Processing Architecture Approach on FPGA, Design Automation and Test in Europe. IEEE, pp.610-615, 2008.

G. Bosilca, A generic distributed DAG engine for High Performance Computing ». en, Parallel Computing, vol.381, issue.2, pp.37-51, 2012.
DOI : 10.1109/ipdps.2011.281

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.186.1874

L. Buitinck, API design for machine learning software : experiences from the scikit-learn project, ECML PKDD Workshop : Languages for Data Mining and Machine Learning, pp.108-122, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00856511

T. Chabardes, P. Dokladal, M. Faessel, and M. Bilodeau, « A parallel, O(n), algorithm for unbiased, thin watershed ». working paper or preprint
DOI : 10.1109/icip.2016.7532823

URL : https://hal.archives-ouvertes.fr/hal-01266889/document

C. Chambers, FlumeJava : easy, efficient data-parallel pipelines ». en, 2010.

C. Clienti, Fulguro image processing library. Source Forge, 2008.

C. Clienti, S. Beucher, and M. Bilodeau, « A System On Chip Dedicated To Pipeline Neighborhood Processing For Mathematical Morphology, EUSIPCO : European Signal Processing Conference. Août, 2008.

F. Coelho and F. Irigoin, API compilation for image hardware accelerators, ACM Transactions on Architecture and Code Optimization, vol.9, issue.4, pp.1-25, 2013.
DOI : 10.1145/2400682.2400708

URL : https://hal.archives-ouvertes.fr/hal-00829650

R. Collobert, S. Bengio, and J. Marithoz, Torch : A Modular Machine Learning Software Library, 2002.

M. Corp, C++ Accelerated Massive Parallelism. url : https

M. Corp, DirectX Graphics. url : https://msdn.microsoft.com/en- us/library

N. Corp, CUDA 7 Streams Simplify Concurrency. url : https://devblogs. nvidia.com/parallelforall/gpu-pro-tip-cuda-7-streams-simplify- concurrency

N. Corp, NVIDIA CUDA Basic Linear Algebra Subroutines library. url : https

N. Corp, NVIDIA CUDA Deep Neural Network library. url : https

N. Corp, NVIDIA CUDA Fast Fourier Transform library. url : https

N. Corp, . Nvidia, and . Toolkit, url : https://developer.nvidia.com/ cuda-toolkit

N. Corp, NVIDIA GPUs -The Engine of Deep Learning. url : https

N. Corp, . Nvidia, and . Thrust, url : https : / / developer . nvidia . com / thrust

C. and M. Paristech, PIPS. Open Source Research Compiler, under GPLv3, 1989.

B. D. De-dinechin, D. Van-amstel, M. Poulhiès, and G. Lager, « Time-critical computing on a single-chip massively parallel processor Design, Automation Test in Europe Conference Exhibition (DATE), pp.2014-2015, 2014.

E. Dougherty, An introduction to morphological image processing, 1992.

M. Faessel, SMIL : Simple (but efficient) Morphological Image Library, 2011.

M. Faessel, M. «. Bilodeau, and . Smil, Simple Morphological Image Library ». In : Séminaire Performance et Généricité, LRDE
URL : https://hal.archives-ouvertes.fr/hal-00836117

A. Facebook and . Research, Learning to Segment. url : https://research. facebook.com/blog/learning-to-segment

M. The and . Forum, The Message Passing Interface. url : http://www.mpi-forum. org

G. Make, url : https://www.gnu.org/software/make

I. Michael and . Gordon, « Compiler Techniques for Scalable Performance of Stream Programs on Multicore Architectures, Thèse de doct

I. Michael, W. Gordon, S. Thies, and . Amarasinghe, Exploiting coarsegrained task, data, and pipeline parallelism in stream programs ». en, In : ACM SIGPLAN Notices, vol.4108, 2014.

T. Goubier, R. Sirdey, S. Louise, and V. David, « C : A Programming Model and Language for Embedded Manycores, p.2011
DOI : 10.1007/978-3-642-24650-0_33

K. Group, Khronos Releases OpenCL 2.2 Provisional Specification with OpenCL C++ Kernel Language for Parallel Programming. Avr. 2016. url : https : / / www . khronos . org / news / press / khronos -releases -opencl -2 . 2 - provisional-spec-opencl-c-kernel-language

K. Group, OpenCL : A State of the Union. Avr. 2016. url : https://www. khronos . org / assets

K. Group, OpenCL Computing Language v1.0. Déc, 2008.

K. Group, OpenCL : The open standard for parallel programming of heterogeneous systems. url : https://www.khronos.org

K. Group, OpenGL : The Industry's Foundation for High Performance Graphics. url : https://www.khronos.org

K. Group, OpenVX : Portable, Power-efficient Vision Processing. url : https

K. Group, SPIR-V : The first open standard intermediate language for parallel compute and graphics. url : https

K. Group, SYCL : C++ Single-source Heterogeneous Programming for OpenCL. url : https://www.khronos.org/sycl

K. Group, Vulkan -Industry Forged. url : https://www.khronos.org/ vulkan

O. Group and . Posix, Threads extensions (IEEE Std 1003.1c-1995). url : http

S. Guelton, Pythran: enabling static optimization of scientific Python programs, Computational Science & Discovery, vol.8, issue.1, 2015.
DOI : 10.1088/1749-4680/8/1/014001

R. Habel, Programmation haute performance pour architectures hybrides, Thèse de doct. MINES ParisTech, nov, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01086632

R. Habel, F. Silber-chaussumier, F. Irigoin, E. Brunet, and F. Trahay, Combining Data and Computation Distribution Directives for Hybrid Parallel Programming : A Transformation System, International Journal of Parallel Programming, vol.26, issue.6, pp.1268-1295, 2016.
DOI : 10.1109/IPDPS.2008.4536139

N. Halbwachs, P. Caspi, P. Raymond, and D. Pilaud, « The synchronous dataflow programming language LUSTRE, Proceedings of the IEEE, pp.1305-1320, 1991.
DOI : 10.1109/5.97300

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.34.5059

. Intel, Intel® AVX-512 instructions. url : https://software.intel.com/en- us/blogs, p.512, 2013.

. Intel, S. Intel®, O. For, and . Applications, url : https://software

F. Irigoin, P. Jouvelot, and R. Triolet, « Semantical interprocedural parallelization : an overview of the PIPS project ». en, Proceedings of ICS 1991, pp.244-25105, 1991.

C. Is, C++ really faster than Python ? url : https

P. Jääskeläinen, pocl: A Performance-Portable OpenCL Implementation, International Journal of Parallel Programming, vol.19, issue.6, pp.752-785, 2015.
DOI : 10.1007/s10766-014-0320-y

Y. Jia, Convolutional Architecture for Fast Feature Embedding, Proceedings of the 22Nd ACM International Conference on Multimedia. MM '14, pp.675-678, 2014.

G. Kahn, « The semantics of a simple language for parallel programming, p.5, 1974.

A. Klöckner and . Pyopencl, OpenCL integration for Python. url : https:// mathema.tician.de/software

C. Lattner and V. Adve, LLVM: A compilation framework for lifelong program analysis & transformation, International Symposium on Code Generation and Optimization, 2004. CGO 2004., 2004.
DOI : 10.1109/CGO.2004.1281665

P. , L. Guernic, A. Benveniste, P. Bournai, and T. Gautier, « Signal?A data floworiented language for signal processing, Acoustics, Speech and Signal Processing, pp.362-374

E. Ashford, L. David, and G. Messerschmitt, « Static Scheduling of Synchronous Data Flow Programs for Digital Signal Processing, IEEE Trans. Comput, vol.36, issue.1, pp.24-35, 1987.

. Références, Corinne Ancourt et Francois Irigoin. « Automatic Code Generation of Distributed Parallel Tasks, Computational Science and Engineering (CSE), 2016 IEEE 19th International Conference on, 2016.

M. Abadi, TensorFlow : Large-Scale Machine Learning on Heterogeneous Systems . Software available from tensorflow

. Microsoft, Using extern to Specify Linkage. url : https://msdn.microsoft. com/en-us/library

E. Gordon and . Moore, Cramming more components onto integrated circuits, 1965.

R. Teja-mullapudi, V. Vasista, U. Bondhugula, and . Polymage, Automatic Optimization for Image Processing Pipelines ». en, pp.429-443, 2015.

. Opencv, Open Source Computer Vision. url : http://opencv.org

F. Pedregosa, Scikit-learn : Machine Learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

L. Peuch, RedBaron, une approche bottom-up au refactoring en Python, 2014.

A. Pop, « Leveraging Streaming for Deterministic Parallelization -an Integrated Language, Compiler and Runtime Approach, Thèse de doct. MINES ParisTech, sept, 2011.

A. Pop and A. Cohen, OpenStream, ACM Transactions on Architecture and Code Optimization, vol.9, issue.4, pp.1-53
DOI : 10.1145/2400682.2400712

URL : https://hal.archives-ouvertes.fr/hal-00786675

J. Ragan-kelley, A Language and Compiler for Optimizing Parallelism, Locality, and Recomputation in Image Processing Pipelines, pp.2013-2025, 2013.

A. Rubinsteyn, E. Hielscher, N. Weinman, and D. Shasha, « Parakeet : A Just-In-Time Parallel Accelerator for Python, p.2012

J. Serra, Image analysis and mathematical morphology, 1982.

M. Richard, . Stallman, and . Developercommunity, Using The Gnu Compiler Collection, 2016.

H. Sutter, Welcome to the Jungle url : http://herbsutter.com/ welcome-to-the-jungle, 2011.

. Xilinx, SDAccel Development Environment. url : http://www.xilinx.com/ products/design-tools/software-zone/sdaccel

P. Guillou, Compilation d'applications de traitement d'images sur architecture MPPA-Manycore. Conférence d'informatique en Parallélisme, Architecture et Système, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01096993

P. Guillou, Compiling Image Processing Applications for Many-Core Accelerators ACACES Summer School: Eleventh International Summer School on Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems . Poster, 2015.

P. Guillou, smilc: A C wrapper for SMIL. url: https

P. Guillou, F. Coelho, and F. Irigoin, Automatic Streamization of Image Processing Applications, Parasol. Hillsboro, United States
DOI : 10.1007/978-3-319-17473-0_15

URL : https://hal.archives-ouvertes.fr/hal-01096735

P. Guillou and B. Pin, Fabien Coelho, and François Irigoin A Dynamic to Static DSL Compiler for Image Processing Applications, 19th Workshop on Compilers for Parallel Computing, 2016.

N. Lossing, P. Guillou, M. Amini, and F. Irigoin, From Data to Effects Dependence Graphs: Source-to-Source Transformations for C, 18th International Workshop on Compilers for Parallel Computing, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01254426

N. Lossing, P. Guillou, and F. Irigoin, Effects Dependence Graph: A Key Data Concept for C Source-to-Source Compilers, 2016 IEEE 16th International Working Conference on Source Code Analysis and Manipulation (SCAM), pp.167-176, 2016.
DOI : 10.1109/SCAM.2016.20

URL : https://hal.archives-ouvertes.fr/hal-01359465

P. Benoît-pin, F. Guillou, and . Coelho, smiltofreia: A SMIL Python to FREIA C compiler. url: https