Разделы презентаций


Illumina data QC & b asic NGS tools

Содержание

From the very beginning...AACCCGTACGTTTTGCAAACGACCGT...

Слайды и текст этой презентации

Слайд 1Illumina data QC &
basic NGS tools


Illumina data QC & basic NGS tools

Слайд 2From the very beginning




...AACCCGTACGTTTTGCAAACGACCGT...

From the very beginning...AACCCGTACGTTTTGCAAACGACCGT...

Слайд 3From the very beginning
Sequencing




...AACCCGTACGTTTTGCAAACGACCGT...
AACCCGTACGT
CGTACGTTTTG
AACGACCG
GTTTTGCAAACG
GTACGTTTTGCA

From the very beginningSequencing...AACCCGTACGTTTTGCAAACGACCGT...AACCCGTACGTCGTACGTTTTGAACGACCGGTTTTGCAAACGGTACGTTTTGCA

Слайд 4From the very beginning
Sequencing

Coverage




...AACCCGTACGTTTTGCAAACGACCGT...
AACCCGTACGT
CGTACGTTTTG
AACGACCG
GTTTTGCAAACG
GTACGTTTTGCA


3x
2x

From the very beginningSequencingCoverage...AACCCGTACGTTTTGCAAACGACCGT...AACCCGTACGTCGTACGTTTTGAACGACCGGTTTTGCAAACGGTACGTTTTGCA3x2x

Слайд 5From the very beginning
Sequencing

Coverage

Errors
Mismatches





...AACCCGTACGTTTTGCAAACGACCGT...
AACCCGTTCGT
CGTACGTTTTC
AACGACCG
GTTTTGCAAACG
GTACGTTTTGCA

From the very beginningSequencingCoverageErrorsMismatches...AACCCGTACGTTTTGCAAACGACCGT...AACCCGTTCGTCGTACGTTTTCAACGACCGGTTTTGCAAACGGTACGTTTTGCA

Слайд 6From the very beginning
Sequencing

Coverage

Errors
Mismatches
Indels





...AACCCGTACGTTTTGCAAACGACCGT...
AACCCGTTCGT
CGTACGTTTTTC
AACGACCG
GTTTTGCAAACG
GTA_GTTTTGCA

From the very beginningSequencingCoverageErrorsMismatchesIndels...AACCCGTACGTTTTGCAAACGACCGT...AACCCGTTCGTCGTACGTTTTTCAACGACCGGTTTTGCAAACGGTA_GTTTTGCA

Слайд 7Early days
Sanger sequencing
Long reads (~900 bp)
Low coverage (< 10x)
Extreme cost

Human

genome project
3 Gbp
3 billion USD
10 years


Early daysSanger sequencingLong reads (~900 bp)Low coverage (< 10x)Extreme costHuman genome project3 Gbp3 billion USD10 years

Слайд 8NGS
Shorter reads (25-400bp)
High coverage (50-1000x)
Huge amount of data
Low cost
More

applications
Required completely new algorithms

NGSShorter reads (25-400bp)High coverage (50-1000x)Huge amount of data Low costMore applicationsRequired completely new algorithms

Слайд 9NGS technologies

NGS technologies

Слайд 10Illumina sequencing
http://www.youtube.com/watch?v=77r5p8IBwJk

Illumina sequencinghttp://www.youtube.com/watch?v=77r5p8IBwJk

Слайд 11IonTorrent sequencing
https://www.youtube.com/watch?v=WYBzbxIfuKs

IonTorrent sequencinghttps://www.youtube.com/watch?v=WYBzbxIfuKs

Слайд 12Paired reads

AACCCGTACGTTTTGCAAACGACCGTAACCAAATTGG



AACCCGTACGT........TAACCAAATTGG
insert size


Paired-end (< 1 kbp)
Mate-pairs (1 - 20 kbp)

Paired readsAACCCGTACGTTTTGCAAACGACCGTAACCAAATTGGAACCCGTACGT........TAACCAAATTGGinsert sizePaired-end (< 1 kbp)Mate-pairs (1 - 20 kbp)

Слайд 13Insert size distribution
Insert size
# of reads

Insert size distributionInsert size# of reads

Слайд 14FASTA/FASTQ
FASTA
>EAS20_8_6_1_9_1972/1
ACCACCATTACCACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGC
>EAS20_8_6_1_163_1521/1
GCAGAAAACGTTCTGCATTTGCCACTGATGTACCGCCGAACTTCAACACTCGCA
FASTQ
@EAS20_8_6_1_1477_92/1
ACCGTTACCTGTGGTAATGGTGATGGTGGTGGTAATGGTGGTGCTAATGCGTTT
+EAS20_8_6_1_1477_92/1
HHGHFHHHHHHHHHGFFHHHBG?GGC8DD9GF??=FFBCGBAF>FGCFHGHGGG
Phred quality
Q = [ - 10 log10 p / (1

- p) ]

FASTA/FASTQFASTA>EAS20_8_6_1_9_1972/1ACCACCATTACCACCACCATCACCATTACCACAGGTAACGGTGCGGGCTGACGC>EAS20_8_6_1_163_1521/1GCAGAAAACGTTCTGCATTTGCCACTGATGTACCGCCGAACTTCAACACTCGCAFASTQ@EAS20_8_6_1_1477_92/1ACCGTTACCTGTGGTAATGGTGATGGTGGTGGTAATGGTGGTGCTAATGCGTTT+EAS20_8_6_1_1477_92/1HHGHFHHHHHHHHHGFFHHHBG?GGC8DD9GF??=FFBCGBAF>FGCFHGHGGGPhred qualityQ = [ - 10 log10 p / (1 - p) ]

Слайд 15seqtk utility
Subsampling sample
Converting between interleaved/paired files mergepe, seq -1/-2
fastq->fasta seq -A
Quality trimming
Shifting the

quality
Modifying names
etc...

seqtk utilitySubsampling sampleConverting between interleaved/paired files mergepe, seq -1/-2fastq->fasta seq -AQuality trimmingShifting the qualityModifying namesetc...

Слайд 16Quality Control

Quality Control

Слайд 17FastQC
Easy and lightweight quality control for sequencing data
Does not require

reference genome

FastQCEasy and lightweight quality control for sequencing dataDoes not require reference genome

Слайд 18
Per base sequence quality

Per base sequence quality

Слайд 19
Per base sequence quality

Per base sequence quality

Слайд 20Per sequence GC content

Per sequence GC content

Слайд 21Per sequence GC content

Per sequence GC content

Слайд 22Per sequence GC content

Per sequence GC content

Слайд 23Per base sequence content

Per base sequence content

Слайд 24Per base sequence content

Per base sequence content

Слайд 25FastQC
fastqc -h
mkdir
fastqc …

-o

FastQCfastqc -hmkdir fastqc  …         -o

Слайд 26Error correction

Error correction

Слайд 27
Per base sequence quality

Per base sequence quality

Слайд 28Trimmomatic
SE LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36

Remove leading

low quality or N bases (below quality 3) (LEADING:3)
Remove trailing

low quality or N bases (below quality 3) (TRAILING:3)

TrimmomaticSE  		LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36Remove leading low quality or N bases (below quality 3) (LEADING:3)Remove trailing

Слайд 29Trimmomatic
Scan the read with a 4-base wide sliding window, cutting

when the average quality per base drops below 15 (SLIDINGWINDOW:4:15)
Drop

reads below the 36 bases long (MINLEN:36)


TrimmomaticScan the read with a 4-base wide sliding window, cutting when the average quality per base drops

Слайд 30Trimmomatic
PE

paired> OPTIONS
ILLUMINACLIP:
ILLUMINACLIP:TruSeq3-PE.fa

TrimmomaticPE    OPTIONS ILLUMINACLIP: ILLUMINACLIP:TruSeq3-PE.fa

Слайд 31Adapter trimming
ILLUMINACLIP::::

ILLUMINACLIP:NexteraPE-PE.fa:2:10:30

Adapter trimmingILLUMINACLIP:::: ILLUMINACLIP:NexteraPE-PE.fa:2:10:30

Слайд 32Thank you!


Thank you!

Обратная связь

Если не удалось найти и скачать доклад-презентацию, Вы можете заказать его на нашем сайте. Мы постараемся найти нужный Вам материал и отправим по электронной почте. Не стесняйтесь обращаться к нам, если у вас возникли вопросы или пожелания:

Email: Нажмите что бы посмотреть 

Что такое TheSlide.ru?

Это сайт презентации, докладов, проектов в PowerPoint. Здесь удобно  хранить и делиться своими презентациями с другими пользователями.


Для правообладателей

Яндекс.Метрика