R&D Engineer in Fault Tolerant Distributed Systems
|
|
| Reference |
|
|
| Job Type |
Full-time |
|
| Job Status |
Sourcing |
|
| Date Posted |
Tuesday, 22 January 2008 |
|
| Location |
|
|
| Duration |
|
|
| Company Information |
IRISA/INRIA
Campus universitaire de Beaulieu
Rennes cedex,
35042
Website: http://www.irisa.fr/
|
|
| Job Description |
This position is proposed in the framework of the XtreemOS European FP6 Integrated Project (http://www.xtreemos.eu). XtreemOS project aims at building and promoting a Linux-based Operating System to Support Virtual Organizations for Next Generation Grids. INRIA is the scientific coordinator of XtreemOS project. The consortium is composed of 19 industrial and academic partners from 8 different countries (7 European countries and China).
The R&D engineer will participate in the research and development activities related to the design and implementation of a Grid operating system that will be conducted by INRIA Rennes within the XtreemOS project.
The research and development engineer will participate in the design and implementation of XtreemOS, a Grid system based on Linux operating system.
She/he will be in charge of the activities on fault tolerance INRIA is responsible for in the framework of XtreemOS. The engineer will contribute to the design and implementation of a Grid checkpointer for Grid applications. In particular, she/he will participate in the following tasks:
- Specification of the global architecture of XtreemOS checkpointing service for Grid applications (distributed applications comprising of sequential or parallel application units executing on different Grid nodes),
- Design and implementation of this architecture at three levels:
• Grid level (extend the job description language to express the fault tolerance strategy, coordinate the local checkpointers executed on Grid nodes for checkpointing a Grid application spanning multiple Grid nodes, integrate the checkpointing service in the application execution management service),
• System level (ensure that the defined fault tolerance strategy for a given application is implemented on the Grid nodes executing it),
• Kernel level to checkpoint application units on individual Grid nodes (extend the existing BLCR Linux process checkpointer to meet XtreemOS needs),
- Experimentation and validation of the implemented architecture and mechanisms.
In addition to the software design and development, the engineer will write documentations (installation, user, reference manuals) and contribute to the deliverables (reports and prototypes). She/he will participate to technical meetings and to the promotion of the XtreemOS software (presenting demonstrations, posters, tutorials).
BLCR is developed as Linux kernel modules developed in C. XtreemOS Grid checkpointer is developed in Java. Experimentations will be carried out on the Grid 5000 experimental Grid platform (http://www.grid5000.fr) consisting of PC clusters located in 9 different sites (INRIA – Rennes being one of the Grid 5000 sites with more than 500 processors).
The engineer will interact with engineers, researchers and PhD students working on the design and implementation of operating systems for clusters and Grids within the PARIS project-team at INRIA Rennes. She/he will also interact with the other industrial and academic partners of the XtreemOS consortium for the XtreemOS development and experimentation. Finally, she/he will interact with BLCR developers from Berkeley National Lab (USA). |
|
| Qualifications |
Profile
PhD degree in Computer Science or engineering or master degree in Computer Science + experience
Required expertise
- Good knowledge in operating systems and distributed systems design and implementation
- Knowledge of Linux kernel (experience in Linux kernel module development would be very much appreciated)
- Good knowledge of C and Java programming language
- Knowledge in Grid computing
- Knowledge in fault tolerance
- Experience with Linux and open source software projects would be very much appreciated
- Experience in software development in a large project would be appreciated
- Very good knowledge of English (read, spoken, written) demanded
- Ability and liking for team work and relationships
- Rigor, autonomy, organizational skills
- Good writing ability |
|
| How to Apply |
Contact
Christine Morin
XtreemOS project leader
PARIS team
IRISA/INRIA
Campus universitaire de Beaulieu
35042 Rennes cedex (France)
Email: Christine.Morin@irisa.fr
Phone: +33 2 99 84 72 90
Application
Applicants should send their application file including a motivation letter, a full resume and up to five reference letters to
Sandrine L’Hermitte
IRISA/UR INRIA Rennes
Campus de Beaulieu
35042 Rennes cedex (France)
Email: Sandrine.L_Hermitte@irisa.fr |
|
| Email Resume To |
Christine.Morin@irisa.fr |
|
| Contact Info |
Christine Morin
Contact Phone: +33 2 99 84 72 90
Contact Email: Christine.Morin@irisa.fr
|
|
|
|
| Go back to last page |