Large-Scale Machine Learning on Heterogeneous Distributed Systems ,
An Introduction to Computational Networks and the Computational Network Toolkit ,
url : https://www.altera.com/products/ design-software/embedded-software-developers ,
« Source-to-Source Automatic Program Transformations for GPU-like Hardware Accelerators, Thèse de doct. MINES ParisTech, déc, 2012. ,
url : https : / / www . arm . com / products / processors/technologies/biglittleprocessing ,
Extended Cyclostatic Dataflow Program Compilation and Execution for an Integrated Manycore Processor, et Peter M. A. Sloot. T. 18. Procedia Computer Science, pp.1624-1633, 2013. ,
DOI : 10.1016/j.procs.2013.05.330
URL : https://hal.archives-ouvertes.fr/hal-00832504
Theano : new features and speed improvements. Deep Learning and Unsupervised Feature Learning NIPS 2012 Workshop, 2012. ,
Cython: The Best of Both Worlds, Computing in Science & Engineering, vol.13, issue.2, pp.31-39, 2011. ,
DOI : 10.1109/MCSE.2010.118
Theano : a CPU and GPU Math Expression Compiler, Proceedings of the Python for Scientific Computing Conference (SciPy), 2010. ,
« Segmentation d'images et morphologie mathématique, Thèse de doct. École Nationale Supérieure des Mines de Paris, 1990. ,
« The morphological approach to segmentation : the watershed transformation, OPTICAL ENGINEERING-NEW YORK-MARCEL DEKKER INCORPORATED-34, pp.433-433, 1992. ,
FREIA : FRamework for Embedded Image Applications. French ANR-funded project with ARMINES (CMM, CRI), THALES (TRT) and Télécom Bretagne, 2008. ,
Cycle-static dataflow, Cycle-static dataflow, pp.397-408, 1996. ,
DOI : 10.1109/78.485935
OpenMP Application Programming Interface ,
Definition and SIMD Implementation of a Multi-Processing Architecture Approach on FPGA, Design Automation and Test in Europe. IEEE, pp.610-615, 2008. ,
A generic distributed DAG engine for High Performance Computing ». en, Parallel Computing, vol.381, issue.2, pp.37-51, 2012. ,
DOI : 10.1109/ipdps.2011.281
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.186.1874
API design for machine learning software : experiences from the scikit-learn project, ECML PKDD Workshop : Languages for Data Mining and Machine Learning, pp.108-122, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00856511
« A parallel, O(n), algorithm for unbiased, thin watershed ». working paper or preprint ,
DOI : 10.1109/icip.2016.7532823
URL : https://hal.archives-ouvertes.fr/hal-01266889/document
FlumeJava : easy, efficient data-parallel pipelines ». en, 2010. ,
Fulguro image processing library. Source Forge, 2008. ,
« A System On Chip Dedicated To Pipeline Neighborhood Processing For Mathematical Morphology, EUSIPCO : European Signal Processing Conference. Août, 2008. ,
API compilation for image hardware accelerators, ACM Transactions on Architecture and Code Optimization, vol.9, issue.4, pp.1-25, 2013. ,
DOI : 10.1145/2400682.2400708
URL : https://hal.archives-ouvertes.fr/hal-00829650
Torch : A Modular Machine Learning Software Library, 2002. ,
C++ Accelerated Massive Parallelism. url : https ,
DirectX Graphics. url : https://msdn.microsoft.com/en- us/library ,
CUDA 7 Streams Simplify Concurrency. url : https://devblogs. nvidia.com/parallelforall/gpu-pro-tip-cuda-7-streams-simplify- concurrency ,
NVIDIA CUDA Basic Linear Algebra Subroutines library. url : https ,
NVIDIA CUDA Deep Neural Network library. url : https ,
NVIDIA CUDA Fast Fourier Transform library. url : https ,
url : https://developer.nvidia.com/ cuda-toolkit ,
NVIDIA GPUs -The Engine of Deep Learning. url : https ,
url : https : / / developer . nvidia . com / thrust ,
PIPS. Open Source Research Compiler, under GPLv3, 1989. ,
« Time-critical computing on a single-chip massively parallel processor Design, Automation Test in Europe Conference Exhibition (DATE), pp.2014-2015, 2014. ,
An introduction to morphological image processing, 1992. ,
SMIL : Simple (but efficient) Morphological Image Library, 2011. ,
Simple Morphological Image Library ». In : Séminaire Performance et Généricité, LRDE ,
URL : https://hal.archives-ouvertes.fr/hal-00836117
Learning to Segment. url : https://research. facebook.com/blog/learning-to-segment ,
The Message Passing Interface. url : http://www.mpi-forum. org ,
url : https://www.gnu.org/software/make ,
« Compiler Techniques for Scalable Performance of Stream Programs on Multicore Architectures, Thèse de doct ,
Exploiting coarsegrained task, data, and pipeline parallelism in stream programs ». en, In : ACM SIGPLAN Notices, vol.4108, 2014. ,
« C : A Programming Model and Language for Embedded Manycores, p.2011 ,
DOI : 10.1007/978-3-642-24650-0_33
Khronos Releases OpenCL 2.2 Provisional Specification with OpenCL C++ Kernel Language for Parallel Programming. Avr. 2016. url : https : / / www . khronos . org / news / press / khronos -releases -opencl -2 . 2 - provisional-spec-opencl-c-kernel-language ,
OpenCL : A State of the Union. Avr. 2016. url : https://www. khronos . org / assets ,
OpenCL Computing Language v1.0. Déc, 2008. ,
OpenCL : The open standard for parallel programming of heterogeneous systems. url : https://www.khronos.org ,
OpenGL : The Industry's Foundation for High Performance Graphics. url : https://www.khronos.org ,
OpenVX : Portable, Power-efficient Vision Processing. url : https ,
SPIR-V : The first open standard intermediate language for parallel compute and graphics. url : https ,
SYCL : C++ Single-source Heterogeneous Programming for OpenCL. url : https://www.khronos.org/sycl ,
Vulkan -Industry Forged. url : https://www.khronos.org/ vulkan ,
Threads extensions (IEEE Std 1003.1c-1995). url : http ,
Pythran: enabling static optimization of scientific Python programs, Computational Science & Discovery, vol.8, issue.1, 2015. ,
DOI : 10.1088/1749-4680/8/1/014001
Programmation haute performance pour architectures hybrides, Thèse de doct. MINES ParisTech, nov, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01086632
Combining Data and Computation Distribution Directives for Hybrid Parallel Programming : A Transformation System, International Journal of Parallel Programming, vol.26, issue.6, pp.1268-1295, 2016. ,
DOI : 10.1109/IPDPS.2008.4536139
« The synchronous dataflow programming language LUSTRE, Proceedings of the IEEE, pp.1305-1320, 1991. ,
DOI : 10.1109/5.97300
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.34.5059
Intel® AVX-512 instructions. url : https://software.intel.com/en- us/blogs, p.512, 2013. ,
url : https://software ,
« Semantical interprocedural parallelization : an overview of the PIPS project ». en, Proceedings of ICS 1991, pp.244-25105, 1991. ,
C++ really faster than Python ? url : https ,
pocl: A Performance-Portable OpenCL Implementation, International Journal of Parallel Programming, vol.19, issue.6, pp.752-785, 2015. ,
DOI : 10.1007/s10766-014-0320-y
Convolutional Architecture for Fast Feature Embedding, Proceedings of the 22Nd ACM International Conference on Multimedia. MM '14, pp.675-678, 2014. ,
« The semantics of a simple language for parallel programming, p.5, 1974. ,
OpenCL integration for Python. url : https:// mathema.tician.de/software ,
LLVM: A compilation framework for lifelong program analysis & transformation, International Symposium on Code Generation and Optimization, 2004. CGO 2004., 2004. ,
DOI : 10.1109/CGO.2004.1281665
« Signal?A data floworiented language for signal processing, Acoustics, Speech and Signal Processing, pp.362-374 ,
« Static Scheduling of Synchronous Data Flow Programs for Digital Signal Processing, IEEE Trans. Comput, vol.36, issue.1, pp.24-35, 1987. ,
Corinne Ancourt et Francois Irigoin. « Automatic Code Generation of Distributed Parallel Tasks, Computational Science and Engineering (CSE), 2016 IEEE 19th International Conference on, 2016. ,
TensorFlow : Large-Scale Machine Learning on Heterogeneous Systems . Software available from tensorflow ,
Using extern to Specify Linkage. url : https://msdn.microsoft. com/en-us/library ,
Cramming more components onto integrated circuits, 1965. ,
Automatic Optimization for Image Processing Pipelines ». en, pp.429-443, 2015. ,
Open Source Computer Vision. url : http://opencv.org ,
Scikit-learn : Machine Learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00650905
RedBaron, une approche bottom-up au refactoring en Python, 2014. ,
« Leveraging Streaming for Deterministic Parallelization -an Integrated Language, Compiler and Runtime Approach, Thèse de doct. MINES ParisTech, sept, 2011. ,
OpenStream, ACM Transactions on Architecture and Code Optimization, vol.9, issue.4, pp.1-53 ,
DOI : 10.1145/2400682.2400712
URL : https://hal.archives-ouvertes.fr/hal-00786675
A Language and Compiler for Optimizing Parallelism, Locality, and Recomputation in Image Processing Pipelines, pp.2013-2025, 2013. ,
« Parakeet : A Just-In-Time Parallel Accelerator for Python, p.2012 ,
Image analysis and mathematical morphology, 1982. ,
Using The Gnu Compiler Collection, 2016. ,
Welcome to the Jungle url : http://herbsutter.com/ welcome-to-the-jungle, 2011. ,
SDAccel Development Environment. url : http://www.xilinx.com/ products/design-tools/software-zone/sdaccel ,
Compilation d'applications de traitement d'images sur architecture MPPA-Manycore. Conférence d'informatique en Parallélisme, Architecture et Système, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01096993
Compiling Image Processing Applications for Many-Core Accelerators ACACES Summer School: Eleventh International Summer School on Advanced Computer Architecture and Compilation for High-Performance and Embedded Systems . Poster, 2015. ,
smilc: A C wrapper for SMIL. url: https ,
Automatic Streamization of Image Processing Applications, Parasol. Hillsboro, United States ,
DOI : 10.1007/978-3-319-17473-0_15
URL : https://hal.archives-ouvertes.fr/hal-01096735
Fabien Coelho, and François Irigoin A Dynamic to Static DSL Compiler for Image Processing Applications, 19th Workshop on Compilers for Parallel Computing, 2016. ,
From Data to Effects Dependence Graphs: Source-to-Source Transformations for C, 18th International Workshop on Compilers for Parallel Computing, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01254426
Effects Dependence Graph: A Key Data Concept for C Source-to-Source Compilers, 2016 IEEE 16th International Working Conference on Source Code Analysis and Manipulation (SCAM), pp.167-176, 2016. ,
DOI : 10.1109/SCAM.2016.20
URL : https://hal.archives-ouvertes.fr/hal-01359465
smiltofreia: A SMIL Python to FREIA C compiler. url: https ,