One of the most difficult problems in the field of genomics is assembling relatively short "reads" of DNA into complete chromosomes. In a new paper published in Proceedings of the National Academy of Sciences an interdisciplinary group of genome and computer scientists has solved this problem, creating an algorithm that can rapidly create "virtual chromosomes" with no prior information about how the genome is organized.
The powerful DNA sequencing methods developed about 15 years ago, known as next generation sequencing (NGS) technologies, create thousands of short fragments. In species whose genetics has already been extensively studied, existing information can be used to organize and order the NGS fragments, rather like using a sketch of the complete picture as a guide to a jigsaw puzzle. But as genome scientists push into less-studied species, it becomes more difficult to finish the puzzle.
To solve this problem, a team led by Harris Lewin, distinguished professor of evolution and ecology and vice chancellor for research at the University of California, Davis and Jian Ma, assistant professor at the University of Illinois at Urbana-Champaign created a computer algorithm that uses the known chromosome organization of one or more known species and NGS information from a newly sequenced genome to create virtual chromosomes.
"We show for the first time that chromosomes can be assembled from NGS data without the aid of a preexisting genetic or physical map of the genome," Lewin said.
The new algorithm will be very useful for large-scale sequencing projects such as G10K, an effort to sequence 10,000 vertebrate genomes of which very few have a map, Lewin said.
"As we have shown previously, there is much to learn about phenotypic evolution from understanding how chromosomes are organized in one species relative to other species," he said.
The algorithm is called RACA (for reference-assisted chromosome assembly), co-developed by Jaebum Kim, now at Konkuk University, South Korea, and Denis Larkin of Aberystwyth University, Wales. Kim wrote the software tool which was evaluated using simulated data, standardized reference genome datasets as well as a primary NGS assembly of the newly sequenced Tibetan antelope genome generated by BGI (Shenzhen, China) in collaboration with Professor Ri-Li Ge at Qinghai University, China. Larkin led the experimental validation, in collaboration with scientists at BGI, proving that predictions of chromosome organization were highly accurate.
Ma said that the new RACA algorithm will perform even better as developing NGS technologies produce longer reads of DNA sequence.
"Even with what is expected from the newest generation of sequencers, complete chromosome assemblies will always be a difficult technical issue, especially for complex genomes. RACA predictions address this problem and can be incorporated into current NGS assembly pipelines," Ma said.
University of California - Davis: http://www.ucdavis.edu
This press release was posted to serve as a topic for discussion. Please comment below. We try our best to only post press releases that are associated with peer reviewed scientific literature. Critical discussions of the research are appreciated. If you need help finding a link to the original article, please contact us on twitter or via e-mail.
Corals stir up the water, creating vortices that draw in nutrients and drive away waste, research reveals.
The "gold bowl of Hasanlu" and three skeletons were excavated from beneath a burned building in an ancient Iranian citadel – now we know the full story
Study of engravings in Gibraltar cave could be final nail in the coffin of hypothesis that Neanderthals were cognitively inferior to modern humans
Claims that Ai Hin was faking pregnancy to get better treatment have been debunked by leading panda expert
The recent release of Susan Greenfields new book and the film Lucy, both of which are dependent on tired misconceptions or dubious theories about the brain, suggest one worrying conclusion: we are running out of myths about the brain. So here are some new ones, to keep things mysterious
These are the siphonophores, some 180 known species of gelatinous strings that can grow to 100 feet long, making them some of the longest critters on the planet. But instead of growing as a single body like virtually every other animal, siphonophores clone themselves thousands of times over into half a dozen different types of specialized cloned bodies, all strung together to work as a team---a very deadly team at that.
Researchers who study memory have had a thrilling couple of years. Some have erased memories in people with electroshock therapy, for example. Others have figured out, in mice, how to create false memories and even turn bad memories into good ones.
Hunting bats don't just listen out for male frogs' mating calls: they can also use echolocation to detect when the frogs inflate their throat sacs
A crèche of 30 dinosaur infants looked over by an older animal shows that even terrible lizards needed a night away from the kids
Families have identifiable collections of microbes that travel with them. It can take just 24 hours for the microbes to take over a new house