The Digital Revolution has created a knowledge-based society reliant upon a high-tech global economy. The pace of innovation has been exponential, leaving some to wonder what possibilities the future may hold.
Big Data is the term given for collections of data sets that are too large and complex for traditional hands-on data management and processing. The term comes from the realm of information technology, but across an increasing number of fields, scientists are encountering situations that fit the category of Big Data. Astronomy, genetics, and proteomics are a few of the fields beginning to feel the pressure for managing their data effectively.
There are numerous technical challenges going into setting up a system to process Big Data in reasonable amounts of time. Machine learning algorithms present great potential in their ability to tease out hidden relationships among data sets and make predictions, but these analyses require distributed computing clusters capable of communicating intermediate results between tasks.
by Greg Emmerich, UW Madison, M.S. Biotechnology Program, Early Drug Discovery Class. December 7th, 2012.
Predictive, preventative and personalized medicine is increasingly becoming a reality with recent advances to whole genome sequencing. Genome wide association studies wield impressive amounts of data, but due to differences in populations tested and analysis methodology, there are very few such studies that are reproducible. Great strides are being made in academia and industry, and the more research and data is shared, the more robust future studies will be. Knome Inc. has a unique method for annotating and interpreting genomic data that aims to tackle some of the challenges discussed. Continue reading