J. Allard, S. Cotin, F. Faure, P.-J. Bensoussan, F. Poyer, C. Duriez, H. Delingette, and L. Grisoni. Sofa? an open source framework for medical simulation. In Medicine Meets Virtual Reality (MMVR), 2007.
T. E. Anderson, B. N. Bershad, E. D. Lazowska, and H. M. Levy. Scheduler activations: effective kernel support for the user-level management of parallelism. In SOSP '91: Proceedings of the thirteenth ACM symposium on Operating systems principles, pages 95-109, New York, NY, USA, 1991. ACM Press.
X. Besseron, S. Jafar, T. Gautier, and J.-L. Roch. Cck: An improved coordinated checkpoint/rollback protocol for dataflow applications in kaapi. In IEEE, editor, Proceedings of the IEEE Conference on Information and Communication Technologies (ICTTA '06): from Theory to Applications, pages 3353-3358, Damascus, Syria, April 2006.
G.E. Blelloch. NESL: A Nested Data-Parallel Language. Technical Report CMU-CS-93-129, April 1993.
R. Blumofe and C. Leiserson. Scheduling multithreaded computations by work stealing. In Proceedings of the 35th Annual Symposium on Foundations of Computer Science, Santa Fe, New Mexico., pages 356-368, November 1994.
R. D. Blumofe and P. A. Lisiecki. Adaptive and reliable parallel computing on networks of workstations. In Proceedings of the USENIX 1997 Annual Technical Conference on UNIX and Advanced Computing Systems, pages 133-147, Anaheim, California, January 1997.
R.D. Blumofe, C.F. Joerg, B.C. Kuszmaul, C.E. Leiserson, K.H. Randall, and Y. Zhou. Cilk: An efficient multithreaded runtime system. Journal of Parallel and Distributed Computing, 37(1):55-69, 1996.
R.D. Blumofe and C.E. Leiserson. Space-efficient scheduling of multithreaded computations. SIAM Journal on Computing, 1(27):202-229, 1997.
D.E Culler and Arvind. Resource requirements of dataflow programs. In Proceedings of the 15th Annual International Symposium on Computer Architecture, pages 141-150, Honolulu, Hawai, 1989.
V. Danjean, R. Gillard, S. Guelton, J.-L. Roch, and T. Roche. Adaptive Loops with Kaapi on Multicore and Grid: Applications in Symmetric Cryptography. In Proceedings of the Parallel Symbolic Computation (PA SCO '07), 2007.
J.-G. Dumas, T. Gautier, M. Giesbrecht, P. Giorgi, B. Hovinen, E. Kaltofen, B.D. Saunders, W.J. Turner, and G. Villard. Linbox: A generic library for exact linear algebra. In Proceedings of the International Congress of Mathematical Software. (ICMS'02), Beijing, China, pages 40-50. World Scientific, 2002.
P. Fatourou and P.G. Spirakis. Efficient scheduling of strict multithreaded computations. Theory of Computing Systems, 33(3):173-232, 2000.
H. Franke, R. Russell, and M. Kirkwood. Fuss, futexes and furwocks: Fast userlevel locking in linux. In Proceedings of the Ottawa Linux Symposium, 2002.
M. Frigo, C.E. Leiserson, and K.H. Randall. The implementation of the cilk-5 multithreaded language. In Sigplan'98, pages 212-223, 1998.
F. Galilée, J.-L. Roch, G. Cavalheiro, and M. Doreille. Athapascan-1: On-line building data flow graph in a parallel language. In IEEE, editor, Pact'98, pages 88-95, Paris, France, October 1998.
T. Gautier, R. Revire, and Roch. Athapascan: Api for asynchronous parallel programming. Technical Report RR-0276, APACHE, INRIA Rhône-Alpes, February 2003.
T. Gautier, J.-L. Roch, and F. Wagner. Fine grain distributed implementation of a dataflow language with provable performances. In IEEE, editor, Workshop PAPP 2007 - Practical Aspects of High-Level Parallel Programming in International Conference on Computational Science 2007 (ICCS2007), Beijing, China, May 2007.
Grid5000. http://www.grid5000.org.
L. J. Hendren, G. R. Gao, X. Tang, Y Zhu, X. Xue, H. Cai, and P. Ouellet. Compiling c for the earth multithreaded architecture. In IEEE, editor, Pact'96, pages 12-23, Boston, USA, 1996.
High Performance Fortran Forum. High Performance Fortran language specification, version 1.0. Technical Report CRPC-TR.92225, Houston, Tex., 1993.
J.-J. Hwang, Y.-C. Chow, F. D. Anger, and C.-Y. Lee. Scheduling precedence graphs in systems with interprocessor communication times. SIAM J. Comput., 18(2):244-257, 1989.
S. Jafar, T. Gautier, A. Krings, and J-L. Roch. A checkpoint/recovery model for heterogeneous dataflow computations using work-stealing. In Proceedings of (LNCS) EuroPar'05, Lisboa, Portugal, August 2005.
S. Jafar, A. Krings, T. Gautier, and J-L. Roch. Theft-induced checkpointing for reconfigurable dataflow applications. In Proceedings of the IEEE Electro/Information Technology Conference EIT2005, Lincoln, Nebraska,U.S.A., May 2005.
S. Jafar, L. Pigeon, T. Gautier, and J.-L. Roch. Self-adaptation of parallel applications in heterogeneous and dynamic architectures. In IEEE, editor, Proceedings of the IEEE Conference on Information and Communication Technologies (ICTTA '06): from Theory to Applications, pages 3347-3352, Damascus, Syria, April 2006.
Kaapi. http://kaapi.gforge.inria.fr.
G. Karypis and V. Kumar. Analysis of multilevel graph partitioning. In Supercomputing '95: Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), page 29, New York, NY, USA, 1995. ACM Press.
A. Mainwaring and D. Culler. Active message applications programming interface and communication subsystem organization. Technical Report CSD-96-918.
C J. Morrone, J. N. Amaral, G. Tremblay, and G. R. Gao. A Multi-Threaded Runtime System for a Multi-Processor/Multi-Node Cluster. In Kluwer Academic, editor, 15th Annual International Symposium on High Performance Computing Systems and Applications, pages 18-20, Windsor, ON, Canada, 2001.
G.J. Narlikar. Scheduling threads for low space requirement and good locality. Number TR CMU-CS-99-121, may 1999. Extended version of the paper published in Spaa'99.
Institute of Electrical and Inc. Electronic Engineers. Information Technology - Portable Operating Systems Interface (POSIX) - Part: System Application Program Interface (API) - Amendment 2: Threads Extension [C Language]. IEEE Standard 1003.1c-1995, IEEE, New York, NY, 1995.
F. Pellegrini and J. Roman. Experimental analysis of the dual recursive bipartitioning algorithm for static mapping. Technical Report 1038-96, 1996.
M. L. Powell, Steve R. Kleiman, S. Barton, D. Shah, D. Stein, and M. Weeks. Sunos multi-thread architecture. In USENIX Winter, pages 65-80, 1991.
R. Revire, F. Zara, and T. Gautier. Efficient and easy parallel implementation of large numerical simulation. In Springer, editor, Proceedings of ParSim03 of EuroPVM/MPI03, pages 663-666, Venice, Italy, 2003.
M.C. Rinard and M.S. Lam. The design, implementation, and evaluation of Jade. ACM Trans. Programming Languages and Systems, 20(3):483-545, 1998.
X. Tang, J. Wang, K. B. Theobald, and G. R. Gao. Thread partitioning and scheduling based on cost model. In ACM Symposium on Parallel Algorithms and Architectures, pages 272-281, 1997.
R. Vuduc, J. W. Demmel, and K. A. Yelick. OSKI: A library of automatically tuned sparse matrix kernels. Journal of Physics Conference Series, 16:521-530, January 2005.
T. Yang and A. Gerasoulis. DSC: Scheduling Parallel Tasks on an Unbounded Number of Processors. IEEE Trans. Parallel Distrib. Syst., 5(9):951-967, 1994.