Subscribe by email
Join 919 other subscribersMeta
Tag Archives: Hadoop
Riding the Elephant
I recently received my first batch of reads from a single paired-end lane run on an [Illumina Hi-Seq](http://www.illumina.com/systems/hiseq_2000.ilmn) instrument. This batch totaled about 20 billion basepairs of DNA sequence, and the associated data files a combined 55.4 gigs of text. … Continue reading →
Share this:
- Share on Bluesky (Opens in new window) Bluesky
- Share on Mastodon (Opens in new window) Mastodon
- Share on Reddit (Opens in new window) Reddit
- Share on LinkedIn (Opens in new window) LinkedIn
- Share on Pocket (Opens in new window) Pocket
- More
- Print (Opens in new window) Print
- Email a link to a friend (Opens in new window) Email
- Share on Facebook (Opens in new window) Facebook
- Share on X (Opens in new window) X
- Share on Tumblr (Opens in new window) Tumblr
- Share on Pinterest (Opens in new window) Pinterest
- Share on Telegram (Opens in new window) Telegram
- Share on WhatsApp (Opens in new window) WhatsApp
Posted in bioinformatics, next generation sequencing, software
|
Tagged Cluster Computing, Hadoop, MapReduce, NGS, Python
|
9 Comments