Title :
A Multi-objective Optimisation Approach to the Design of Experiment in De Novo Assembly Projects
Author :
Nadalin, Francesca ; Vezzi, Francesco ; Policriti, Alberto
Author_Institution :
Dipt. di Mat. ed Inf., Univ. degli studi di Udine, Udine, Italy
Abstract :
Genomics projects are characterised by difficult biological pipelines and high sequencing costs. In particular, de novo assembly projects must go through data production, assembly, and results validation. Early mistakes in the first (and most expensive) step can therefore be detected only at a very late stage and have serious consequences. Our goal is to design a pipeline able to provide the users with the optimal input for the sequencing experiments within a de novo assembly project. We present a new approach, based on multi-objective optimisation, aiming at transforming the design of genomics experiments from a set of "best practices" to an algorithmically controlled procedure. We implemented our model with mode FRONTIER and we show how our method can be used to infer the final quality of a whole genome assembly project from the results obtained on a small but representative sample.
Keywords :
DNA; Pareto optimisation; bioinformatics; genomics; Pareto optimisation; data production; de novo assembly projects; genome assembly project; genomics experiment design; modeFRONTIER; multiobjective optimisation approach; optimal input; result validation; sequencing costs; Assembly; Bioinformatics; Genomics; Instruments; Optimization; Pipelines; Production; Pareto optima; de novo assembly; multi-objective optimisation; sequencing technologies;
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on
Conference_Location :
Vienna
Print_ISBN :
978-1-4673-2621-6
DOI :
10.1109/DEXA.2012.42