P-arch - Digital Repository

Automated and traceable processing for large-scale high-throughput sequencing facilities

Mostra i principali dati

dc.contributor.author Pireddu, Luca
dc.contributor.author Cuccuru, Gianmauro
dc.contributor.author Lianas, Luca
dc.contributor.author Vocale, Matteo
dc.contributor.author Fotia, Giorgio
dc.contributor.author Zanetti, Gianluigi
dc.date.accessioned 2014-05-16T07:53:52Z
dc.date.available 2014-05-16T07:53:52Z
dc.date.issued 2013
dc.identifier.issn 2226-6089
dc.identifier.uri http://hdl.handle.net/11050/908
dc.description.abstract Scaling up production in medium and large high-throughput sequencing facilities presents a number of challenges. As the rate of samples to process increases, manually performing and tracking the center’s operations becomes increasingly difficult, costly and error prone, while processing the massive amounts of data poses significant computational challenges. We present our ongoing work to automate and track all data-related procedures at the CRS4 Sequencing and Genotyping Platform, while integrating state-of-the-art processing technologies such as Hadoop, OMERO, iRODS, and Galaxy into our automated workflows. Currently, the core system is in its testing phase and it is on schedule to be in production use at CRS4 by May 2013. The results thus far obtained are encouraging and the authors are confident that the CRS4 Platform will increase its efficiency and capacity thanks to this system. In the near future, the integration components will be released as as open source software. IT
dc.language.iso en IT
dc.relation.ispartof EMBnet.journal. The Next NGS Challenge Conference: Data Processing and Integration 14-16 May 2013, Valencia, Spain IT
dc.relation.ispartofseries 19;Suppl. A
dc.rights Attribuzione - Non commerciale - Condividi allo stesso modo 3.0 Italia *
dc.rights.uri http://creativecommons.org/licenses/by-nc-sa/3.0/it/ *
dc.subject ngs IT
dc.subject automation IT
dc.subject bioinformatics IT
dc.subject data analysis IT
dc.subject high-performance computing IT
dc.title Automated and traceable processing for large-scale high-throughput sequencing facilities IT
dc.type Articolo IT
dc.description.pagenumber 23-24 IT
dc.description.status Pubblicato IT
dc.identifier.doi 10.14806/ej.19.A.626 IT
dc.subject.een-cordis EEN CORDIS::SCIENZE BIOLOGICHE ::Ricerca sul genoma ::Bioinformatica IT


File allegati

I seguenti file di Licenza sono associati a questo inserimento:

Questo inserimento fa parte delle seguenti collezioni

Mostra i principali dati

Attribuzione - Non commerciale - Condividi allo stesso modo 3.0 Italia Attribuzione - Non commerciale - Condividi allo stesso modo 3.0 Italia