Professional Documents
Culture Documents
Bioinformatic Methods I
Bioinformatic Methods I
2007
Illumina
Next-Gen
Genome Analyzer
Billion
Million
Thousand
1990
Dideoxy
Sequencing
2000
AB 3730xl
Automated
Sequencer
Bioinformatic Methods I
Trillion
2013
Illumina HiSeq
Genome Analyzer
Bioinformatic Methods I
Platform
Reads
Sanger
454
Illumina
750bp
~14M
$667B
400bp
2,500
$37.5M
2 x 150bp
3.3
$50K
1000
3 Mb
3 Gb
300 x
~ 1 Tb
Read
Length
(bases)
Paired
Ends
Run Time
(days)
Yield
(Gb)
Rate
(days/Gb)
Sanger
96
750
No
0.5
0.00007
~7000 days
1M
400
yes
0.8
~1.25 days
1B
150
yes
11
300
~1 hr
Bioinformatic Methods I
Bioinformatic Methods I
Bioinformatic Methods I
G
A
A
T
C
C
G
T
A
C
C
T
T
G
G
A
A
C
A
C
T
C
T
T
A
G
Bioinformatic Methods I
Bioinformatic Methods I
NGS Assembly
Cut poem into short reads
Each part of poem is read multiple
times
Assemble poem by assembling
overlapping reads
Bioinformatic Methods I
Assembly
And hast thou slain the Jabberwock? Come to my arms, my beamish boy! O frabjous day! Callooh! Callay! He chortled in his joy.
chortled in h
Callooh! Calla
ish boy! O
jous day! Ca
Come to my ar
bberwock? C
s, my beamish
ay! He chortle
Contig
And hast thou slain the Jabberwock? Come to my arms, my beamish boy! O frabjous day! Callooh! Callay! He chortled in his joy.
Bioinformatic Methods I
Assemby Strategy
Cut poem into short reads
Each part of poem is read multiple
times
Assemble poem by assembling
overlapping reads
Problem
Repetitive regions
the Jabberwock
Bioinformatic Methods I
Assembly Challenges
50 - 1000 Epic Poems (for human
microbiome)
Bioinformatic Methods I
Bioinformatic Methods I
Image courtesy of Jessica Yang, M.Sc. Thesis (2011), adapted from Schatz et al. (2010) Genome Res. 20:1165-73.
Bioinformatic Methods I
Bioinformatic Methods I
Bioinformatic Methods I
Bioinformatic Methods I
RNA-Seq
RNA-Seq is a powerful method for quantifying steady-state mRNA expression
levels and detecting alternative splicing events in transcriptomes.
An RNA-Seq pipeline involves creating cDNA, shearing, adding adapters, NGS,
mapping reads, and summarizing the read counts (e.g. FPKM fragments per
kilobase per million fragments mapped). Evaluation of alternative splicing
events may be visualized in a genome browser.
Short reads mapped
to reference genome
in GBrowse instance
(height = # of reads at
a given position)
Bioinformatic Methods I
Convert mRNA
population into
cDNA
Generate short
sequence reads from
cDNA population
Gene 1
Bioinformatic Methods I
Gene 2
N. Provart & D. Guttman Intro for Lab 6 Slide 20
10
Bioinformatic Methods I
Metagenomics
The application of modern genomics techniques to the study of communities of
microbial organisms directly in their natural environments, bypassing the need
for isolation and lab cultivation of individual species
Definition: Chen & Pacter (2005) PLoS CompBio 1:e24. Images courtesy of JGI.
Bioinformatic Methods I
11