Title |
Identifying wrong assemblies in de novo short read primary sequence assembly contigs
|
---|---|
Published in |
Proceedings: Plant Sciences, August 2016
|
DOI | 10.1007/s12038-016-9630-0 |
Pubmed ID | |
Authors |
Vandna Chawla, Rajnish Kumar, Ravi Shankar |
Abstract |
With the advent of short-reads-based genome sequencing approaches, large number of organisms are being sequenced all over the world. Most of these assemblies are done using some de novo short read assemblers and other related approaches. However, the contigs produced this way are prone to wrong assembly. So far, there is a conspicuous dearth of reliable tools to identify mis-assembled contigs. Mis-assemblies could result from incorrectly deleted or wrongly arranged genomic sequences. In the present work various factors related to sequence, sequencing and assembling have been assessed for their role in causing mis-assembly by using different genome sequencing data. Finally, some mis-assembly detecting tools have been evaluated for their ability to detect the wrongly assembled primary contigs, suggesting a lot of scope for improvement in this area. The present work also proposes a simple unsupervised learning-based novel approach to identify mis-assemblies in the contigs which was found performing reasonably well when compared to the already existing tools to report mis-assembled contigs. It was observed that the proposed methodology may work as a complementary system to the existing tools to enhance their accuracy. |
X Demographics
As of 1 July 2024, you may notice a temporary increase in the numbers of X profiles with Unknown location. Click here to learn more.
Geographical breakdown
Country | Count | As % |
---|---|---|
Netherlands | 2 | 13% |
Australia | 2 | 13% |
United Kingdom | 2 | 13% |
France | 1 | 6% |
Norway | 1 | 6% |
Finland | 1 | 6% |
United States | 1 | 6% |
Unknown | 6 | 38% |
Demographic breakdown
Type | Count | As % |
---|---|---|
Scientists | 11 | 69% |
Members of the public | 5 | 31% |
Mendeley readers
Geographical breakdown
Country | Count | As % |
---|---|---|
Netherlands | 1 | 3% |
Norway | 1 | 3% |
Unknown | 30 | 94% |
Demographic breakdown
Readers by professional status | Count | As % |
---|---|---|
Student > Master | 10 | 31% |
Researcher | 7 | 22% |
Lecturer | 2 | 6% |
Student > Bachelor | 2 | 6% |
Student > Ph. D. Student | 2 | 6% |
Other | 3 | 9% |
Unknown | 6 | 19% |
Readers by discipline | Count | As % |
---|---|---|
Agricultural and Biological Sciences | 11 | 34% |
Biochemistry, Genetics and Molecular Biology | 9 | 28% |
Immunology and Microbiology | 2 | 6% |
Computer Science | 1 | 3% |
Medicine and Dentistry | 1 | 3% |
Other | 1 | 3% |
Unknown | 7 | 22% |