Next-Gen Sequence Analysis Workshop (2016)¶
This is the schedule for the 2016 MSU NGS course. All course communications will be organized around the Slack channel.
This workshop has a Workshop Code of Conduct, do read it!
Download all of these materials or visit the GitHub repository.
Meal Times: Breakfast 7:45-8:45, Lunch 12-1, Dinner 6-7 (Unless noted below). See HERE for the menu and other info.
Day | Schedule |
---|---|
Monday 8/8 |
|
Tuesday 8/9 |
|
Wed 8/10 |
|
Thursday 8/11 |
|
Friday 8/12 |
|
Saturday 8/13 |
|
Sunday 8/14 |
|
Monday 8/15 |
|
Tuesday 8/16 |
|
Wed 8/17 |
|
Thursday 8/18 |
|
Friday 8/19 |
|
Dramatis personae¶
Instructors:
- Meg Staton @HardwoodGenomic.
- Matt MacManes @macmanes.
- Torsten Seemann @torstenseemann.
- Shaun Jackman @sjackman.
- Rob Patro @nomad421.
- Ian Dworkin @IanDworkin.
- Sonal Singhal
- Amanda Charbonneau @procrastinomics.
TAs:
- Lisa Cohen @monsterbashseq.
- Will Pitchers @steeljawpanda.
- Ming Chen
She Who Drives Many Places:
- Kate MacManes
Genome Assembly References¶
Papers and References¶
Books¶
Practical Computing for Biologists
These books, especially the 1st, are highly recommended book for people looking for a systematic presentation on shell scripting, programming, UNIX, etc.
RNAseq¶
Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks, Trapnell et al., Nat. Protocols.
One paper that outlines a pipeline with the tophat, cufflinks, cuffdiffs and some associated R scripts.
Statistical design and analysis of RNA sequencing data., Auer and Doerge, Genetics, 2010.
A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae. Nookaew et al., Nucleic Acids Res. 2012.
Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensive in silico assessment of RNA-seq experiments Vijay et al., 2012.
Computational methods for transcriptome annotation and quantification using RNA-seq, Garber et al., Nat. Methods, 2011.
Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments., Bullard et al., 2010.
A comparison of methods for differential expression analysis of RNA-seq data, Soneson and Delorenzi, BMC Bioinformatics, 2013.
Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples., Wagner et al., Theory Biosci, 2012. Also see this blog post explaining the paper in detail.
Computing and Data¶
- A Quick Guide to Organizing Computational Biology Projects, Noble, PLoS Comp Biology, 2009.
- Willingness to Share Research Data Is Related to the Strength of the Evidence and the Quality of Reporting of Statistical Results, Wicherts et al., PLoS One, 2011.
- Got replicability?, McCullough, Economics in Practice, 2007.
Also see this great pair of blog posts on organizing projects and research workflow.
Links¶
Resources¶
-
A high quality question & answer Web site.
-
A discussion and information site for next-generation sequencing.
-
A large number of open and reusable tutorials on the shell, programming, version control, etc.
Blogs¶
http://www.genomesunzipped.org/
Genomes Unzipped.
-
Titus’s blog.
-
Blue Collar Bioinformatics
-
Mass Genomics
-
Next Genetics
http://gettinggeneticsdone.blogspot.com/
Getting Genetics Done
http://omicsomics.blogspot.com/
Omics! Omics!
-
Nick Loman’s lab notebook
http://TheGenomeFactory.blogspot.com/
The Genome Factory (Torsten Seemann)
Material from previous years¶
- Day 1 - Getting started with Amazon
- Getting started with Amazon EC2
- Details!
- Start up an EC2 instance
- Log in
- Select your zone
- Select the machine operating system to boot
- Choose the machine size
- Change the size of the hard drive
- Confirm and launch
- (First time through) generate a new key pair
- (Next times through) select an existing key pair
- Click on View Instances
- Select the public DNS name for later use
- Logging into your new instance “in the cloud” (Windows version)
- Logging into your new instance “in the cloud” (Mac version)
- Terminating your instance
- Amazon Web Services reference material
- Start up an EC2 instance
- Details!
- Running command-line BLAST
- Getting started with Amazon EC2
- Day 2 – Intro to Linux & Quality control
- Variant calling
- Interval Analysis and Visualization
- Running bedtools
- Understanding the SAM format
- Control Flow and loops in R
- Control Flow
- ifelse()
- Other vectorized ways of control flow.
- Simple loops
- for loop
- So for the for loop we would do the following:
- More avoiding loops
- The step above creates a vector of n NA’s. They will be replaced sequentially with the random numbers as we generate them (using a function like the above one).
- Variant calling and exploration of polymorphisms
- A complete de novo assembly and annotation protocol for mRNASeq
- Assembly with SOAPdenovo-Trans
- Mapping and Counting
- Analyzing RNA-seq counts with DESeq
- RNA-seq: mapping to a reference genome with tophat and counting with HT-seq
- RNA-seq: mapping to a reference genome with BWA and counting with HTSeq
- Booting an Amazon AMI
- Updating the operating system
- Install software
- Preparing the reference
- Mapping
- Genome comparison and phylogeny
- Interactive visual genome comparison with Mauve
- Running a genome alignment
- Booting an Amazon AMI
- Logging in & updating the operating system
- Packages to install
- Getting the E. coli genome data
- What is the nearest reference genome?
- Ordering the assembly contigs against a nearby reference
- Making a phylogeny of many E. coli assemblies
- From tree file to figures
- Automation, scripts, git, and GitHub
- MG-RAST and its API
- So you want to get some sequencing data out of NCBI?
- Looking at k-mer abundance distributions
- PacBio Tutorial
- RNASeq Transcript Mapping and Counting (BWA and HtSeq Flavor)
- Evaluating the quality of your short reads, and trimming them
- Getting started with Amazon EC2
- Details!
- Start up an EC2 instance
- Log in
- Select your zone
- Select the machine operating system to boot
- Choose the machine size
- Change the size of the hard drive
- Confirm and launch
- (First time through) generate a new key pair
- (Next times through) select an existing key pair
- Click on View Instances
- Select the public DNS name for later use
- Logging into your new instance “in the cloud” (Windows version)
- Logging into your new instance “in the cloud” (Mac version)
- Terminating your instance
- Amazon Web Services reference material
- Start up an EC2 instance
- Details!
- Instructor’s Guide to ANGUS Materials
- Workshop Code of Conduct
LICENSE: This documentation and all textual/graphic site content is licensed under the Creative Commons - 0 License (CC0) -- fork @ github. Presentations (PPT/PDF) and PDFs are the property of their respective owners and are under the terms indicated within the presentation.
comments powered by Disqus