Bioinformatics course_F6

F6 wiki

F6_Bioinformatics.png

  • Intro:
    ls
    grep
    uniq

BOOK: shell scripting


cd
..
options
ls -l -t order as time edited.
ls -lt the same


https://fantom6-collaboration.gsc.riken.jp/files/mongoDB/

ls -lt | head first 10 line
bunzip2 RunAll_sample_summary.20170830.tsv.bz2 解压bz2

less -S RunAll_sample_summary.20170830.tsv
less RunAll_sample_summary.20170830.tsv
bash-3.2$ wc RunAll_sample_summary.20170830.tsv 7191 212813 1857590 RunAll_sample_summary.20170830.tsv
lines words letters


are sample ID unique or not?
cut -f1 RunAll_sample_summary.20170830.tsv | head
>out put
unique

关于cut的参数: http://man.linuxde.net/cut


SAM/BAM介绍
mapping
formate for reads

Fast: Fasta + quality

bam is the same as SAM,but zip

Each line of SAM start with@

  • QNAME: the name of the read
  • FLAG: what happened with the reads
  • RNAME
  • POS
  • CIGAR: geometry
    drunken sailor?

Bedtools

bed formate

chr start end | option (name score strand (+/-/.))|

start <end

mysql

set theory for genomics

bedtools intersect --help
bedtools intersect -u -a XX.bed -b XXX.bed | head  #to see the unique
#-wa
#-v

is this overlap significant?

bedrolls fisher -m -a XXX.bed -b XXX.bed -g hg38.genome   
#m: merge everything
## two tail p-value

bedtools shuffle

bedtools shuffle -i hg38_gwas.bed
# only one file as input
bedtools shuffle -i hg38_gwas.bed -g hg38.genome  | bedtools sort> hg38_gwas_shuffle.bed

Formate conversion
Coverage Plots


R shiny
R markdown
knitR
R a platform for releasing result
R call variable “objects”

  • vectors 向量
    c()
    paste0()

  • Lists: a second cornerstone class of R
    可以把任何变量加在一个list里面。

  • dataframe

df=data.frame(
a=c()
b=c()
)
summary(df)
#列结合
  • subsetting elements of objects
    by coordinate
    or by name

help(“[“)

  • “nothing”
    NA: missing values, not available
    NULL: nothing
    NaN: 0/0 # result is not a number
sum(1,2,NA, na.rm=T)
3
sum(1, NULL)
1
  • in terminal-
R
barplot(c(1,2,3))
q() # get out of R

first step in R with R studio

on-line http://try.jupyter.org/ (chose Welcome R-demo)

如何在R里搞一些随机数:
http://blog.csdn.net/lilanfeng1991/article/details/18505723

hist(c(1,2,4))
hist(runif(100))
hist(rnorm(100))

GitHub:
https://www.r-bloggers.com/rstudio-and-github/

最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • Introduction What is Bowtie 2? Bowtie 2 is an ultrafast a...
    wzz阅读 5,997评论 0 5
  • 终于筋疲力竭的考试完,接着你的生命如此多情,文末作者说,浪漫主义的时代已经远去,有些无奈有些忧伤,而我说,有我,浪...
    苏以婉阅读 500评论 0 0
  • 【执子之手】儿童学习力六期 践行记录20170522Day7 1.中午继续听高效能事务管理法,听到朵拉币的部分,很...
    cancan妈阅读 159评论 0 0
  • 令狐冲是《笑傲江湖》的主人公,很多读者都比较喜欢这个人物。不过,也有人觉得令狐冲很讨厌,比如岳不群、左冷禅,都很讨...
    张舟子阅读 291评论 0 1
  • 这样,约伯年纪老迈,日子满足而死(约伯记 42:17 和合本) 《约伯记》读完,感觉一切的争论都下了定论:神来了,...
    渔夫简想阅读 703评论 0 1