Amazon Web Services and the United States National Institutes of Health announced that the complete 1000 Genomes Project is available on AWS as a publicly available data set. This makes the largest collection of human genetics available to researchers worldwide, free of charge.
The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalog of human genetic variation. The project has grown to 200 terabytes of genomic data, including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research.
The 1000 Genomes Project aims to include the genomes of more than 2,600 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the public data set this year.
The NIH is part of the U.S. Department of Health and Human Services, and serves as one of the data coordinators for the 1000 Genomes Project.
“Previously, researchers wanting access to public data sets such as the 1000 Genomes Project had to download them from government data centers to their own systems, or have the data physically shipped to them on discs,” Lisa Brooks, program director, Genetic Variation Program, National Human Genome Research Institute (a part of NIH), said in a statement.
“This process took a long time, and that’s assuming a lab had the bandwidth to download the data and sufficient storage and compute infrastructure to hold and analyze the data once they had it,” Brooks said. “We are happy that the 1000 Genomes Project data are on AWS to give researchers anywhere in the world a simple way to access the data so they can put the data to work in their research.”
Putting the data in the AWS cloud “provides a tremendous opportunity for researchers around the world who want to study large-scale human genetic variation but lack the computer capability to do so,” Richard Durbin, co-director of the 1000 Genomes Project and joint head of human genetics at the Wellcome Trust Sanger Institute in England, said in a statement.
Register or login for access to this item and much more
All Information Management content is archived after seven days.
Community members receive:
- All recent and archived articles
- Conference offers and updates
- A full menu of enewsletter options
- Web seminars, white papers, ebooks
Already have an account? Log In
Don't have an account? Register for Free Unlimited Access