One of the most difficult problems in the field of genomics is assembling relatively short "reads" of DNA into complete chromosomes. In a new paper published in Proceedings of the National Academy of Sciences an interdisciplinary group of genome and computer scientists has solved this problem, creating an algorithm that can rapidly create "virtual chromosomes" with no prior information about how the genome is organized.
The powerful DNA sequencing methods developed about 15 years ago, known as next generation sequencing (NGS) technologies, create thousands of short fragments. In species whose genetics has already been extensively studied, existing information can be used to organize and order the NGS fragments, rather like using a sketch of the complete picture as a guide to a jigsaw puzzle. But as genome scientists push into less-studied species, it becomes more difficult to finish the puzzle.
To solve this problem, a team led by Harris Lewin, distinguished professor of evolution and ecology and vice chancellor for research at the University of California, Davis and Jian Ma, assistant professor at the University of Illinois at Urbana-Champaign created a computer algorithm that uses the known chromosome organization of one or more known species and NGS information from a newly sequenced genome to create virtual chromosomes.
"We show for the first time that chromosomes can be assembled from NGS data without the aid of a preexisting genetic or physical map of the genome," Lewin said.
The new algorithm will be very useful for large-scale sequencing projects such as G10K, an effort to sequence 10,000 vertebrate genomes of which very few have a map, Lewin said.
"As we have shown previously, there is much to learn about phenotypic evolution from understanding how chromosomes are organized in one species relative to other species," he said.
The algorithm is called RACA (for reference-assisted chromosome assembly), co-developed by Jaebum Kim, now at Konkuk University, South Korea, and Denis Larkin of Aberystwyth University, Wales. Kim wrote the software tool which was evaluated using simulated data, standardized reference genome datasets as well as a primary NGS assembly of the newly sequenced Tibetan antelope genome generated by BGI (Shenzhen, China) in collaboration with Professor Ri-Li Ge at Qinghai University, China. Larkin led the experimental validation, in collaboration with scientists at BGI, proving that predictions of chromosome organization were highly accurate.
Ma said that the new RACA algorithm will perform even better as developing NGS technologies produce longer reads of DNA sequence.
"Even with what is expected from the newest generation of sequencers, complete chromosome assemblies will always be a difficult technical issue, especially for complex genomes. RACA predictions address this problem and can be incorporated into current NGS assembly pipelines," Ma said.
University of California - Davis: http://www.ucdavis.edu
This press release was posted to serve as a topic for discussion. Please comment below. We try our best to only post press releases that are associated with peer reviewed scientific literature. Critical discussions of the research are appreciated. If you need help finding a link to the original article, please contact us on twitter or via e-mail.
Some people have non-human neighbors of the usual, inspiring kind: Bald eagles and bears, sea lions and salamanders, the sort of creatures found in nature documentaries intoned by deep-voiced narrators who plead on our planet's behalf. But I live in New York City. The star of this show, a charismatic megafauna of my own particular wilderness, is none other than the rat — and what science is teaching us may change how we think of this oft-reviled creature, and maybe even ourselves.
An Oregon company has developed a high-tech process for turning sewage into pure drinking water. Now it's asking the state for permission to give its recycled water to a group of home brewers.
Skull cap of Homo sapiens found in Israeli cave hints at time and place of cross-species mingling
Is there a conscious generosity in how ravens or bats share food, or monkeys or elephants save others, or is it simply the selfish instinct of group survival?
DNA research into early canine remains also raises clues about migration patterns of ancient humans
Research on 85 families finds less than a third of siblings with autism carry the same genetic risk, and in nearly 70% of cases known contributory mutations do not overlap
Flanked by curious fish and tended by a diver, these coral nurseries off the coast of the Florida Keys are being grown as transplants for damaged reefs
If we could turn back the clock millions of years, would animals evolve in the same way? Genome data suggests that their options would be limited
Ah, motherhood. I don’t know anything about it, but I heard there’s a lot of, like, sacrifice and stuff. Not only do you have to bring the brat into the world, but then you have to feed it for at least 18 years or you get in big trouble. That’s a lot of pressure.
It’s three in the morning in South Africa, in the middle of winter. Temperatures have dropped to just …