- Projets
- environment
- i-bio ontological management project of distributed bioinfor
i-bio ontological management project of distributed bioinformatics resources
Biologists are now dealing with new forms of data, such as images (e.g.: DNA chips), or with large sequences (genomes). Widespread use of the Internet and information technologies has changed the working methods of researchers. A scientist’s environment now integrates distributed IT resources (access to data and applications), which poses a range of problems regarding information validation and extraction. This project takes account of the shift from information systems to the integration of distributed resources (web services). It will also involve preparing new forms of Internet such as computing grids. The Internet and Biology integration project is called i-bio.
We have identified three types of systematically handled objects during bioinformatics experiments: data, application software and protocols. We are hoping to work on meta-models describing these objects on the basis of structures that are already published (BSML, OPM, etc.). There are also dependency rules between these objects, such as those provided by the meta-data standards (ex. Topic Maps). But the purpose of i-bio is to show that the introduction of the ontology concept is a decisive stage in the organisation of information and management of IT resources. Throughout the LifeGrid call for tender, the project aims to produce a prototype fulfilling a function of the Clermont-Ferrand Modelling Unit for biological research laboratories in the region. The i-bio project combines IT development and research. We are expecting to take on a post-PhD student and an IT engineer (LifeGrid) to produce the prototype.
The i-bio project combines the development of a database and of two interfaces: the first is for users and the second for services (particularly web services). The user interface will be a graphics program, distributed on individual work stations and developed using technology such as JAVA. The other is a connector between the database and research tools on distributed resources. This project will be for use by the public and the sources will be open as it integrates open application software.
We have defined four biological applications to validate the architecture of i-bio. These will instantiate the database. Two of these case studies concern the development of scientific websites: “minimum proteome” and “Hoplantgen”. Both projects contain a lot of different information and provide the international community with a service thanks to scientific added value. These projects incorporate updating protocols. One case study focuses on data processing and extraction for a metagenome experiment. The last biology application concerns the creation of a monitoring facility for an ontology describing a specific gene context in parasitology.
LifeGrid, the regional information system