forked from YuLab-SMU/ggmsa
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathdata.R
148 lines (120 loc) · 3.1 KB
/
data.R
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
#' A sample data used in ggmsa
#'
#' A dataset containing the alignment sequences of
#' the phenylalanine hydroxylase protein (PH4H)
#' within nine species
#'
#'
#' @docType data
#' @keywords datasets
#' @name sample.fasta
#' @format A MSA fasta with 9 sequences and 456 positions.
NULL
#' GVariation
#'
#' A folder containing 4 MAS files as a sample
#' data set to identify the sequence recombination event.
#'
#' \itemize{
#' \item A.Mont.fas MSA with sequences of 'Mont' and 'CF_YL21'
#' \item B.Oz.fas MSA with sequences of 'Oz' and 'CF_YL21'
#' \item C.Wilga5.fas MSA with sequences of 'Wilga5' and 'CF_YL21'
#' \item sample_alignment.fa MSA with sequences of 'Mont', 'CF_YL21',
#' 'Oz', and 'Wilga5'
#' }
#' @docType data
#' @keywords datasets
#' @name GVariation
#' @format a folder
#' @source \url{https://link.springer.com/article/10.1007/s11540-015-9307-3}
NULL
#' Rfam
#'
#' A folder containing seed alignment sequences and
#' corresponding consensus RNA secondary structure.
#'
#' \itemize{
#' \item RF00458.fasta seed alignment sequences of Cripavirus internal
#' ribosome entry site (IRES)
#' \item RF03120.fasta seed alignment sequences of Sarbecovirus 5'UTR
#' \item RF03120_SS.txt consensus RNA secondary structure of
#' Sarbecovirus 5'UTR
#'
#' }
#' @docType data
#' @keywords datasets
#' @name Rfam
#' @format a folder
#' @source \url{https://rfam.xfam.org/}
NULL
#' Gram-negative_AKL
#'
#' Amino acids in the adenylate kinase lid (AKL) domain
#' from Gram-negative bacteria.
#'
#' @docType data
#' @keywords datasets
#' @name Gram-negative_AKL.fasta
#' @format A MSA fasta with 100 sequences and 36 positions.
#' @source \url{http://biovis.net/year/2013/info/redesign-contest}
NULL
#' Gram-positive_AKL
#'
#' Amino acids in the adenylate kinase lid (AKL) domain
#' from Gram-positive bacteria.
#'
#' @docType data
#' @keywords datasets
#' @name Gram-positive_AKL.fasta
#' @format A MSA fasta with 100 sequences and 36 positions.
#' @source \url{http://biovis.net/year/2013/info/redesign-contest}
NULL
#' A sample DNA alignment sequences
#'
#' DNA alignment sequences with 24 sequences and 56 positions.
#'
#'
#' @docType data
#' @keywords datasets
#' @name LeaderRepeat_All.fa
#' @format A MSA fasta
NULL
#' microRNA data used in ggmsa
#'
#'Fasta format sequences of mature miRNA sequences
#'from miRBase
#'
#'
#' @docType data
#' @keywords datasets
#' @name seedSample.fa
#' @format A MSA fasta with 6 sequences and 22 positions.
#' @source \url{https://www.mirbase.org/ftp.shtml}
NULL
#' sequence-link-tree
#'
#' Alignment sequences used to demonstrate circular MSA layout
#'
#' @docType data
#' @keywords datasets
#' @name sequence-link-tree.fasta
#' @format A MSA fasta with 28 sequences and 480 positions.
NULL
#' TP53 MSA
#'
#' Alignment sequences of used to show graphical combination
#'
#' @docType data
#' @keywords datasets
#' @name tp53.fa
#' @format A MSA fasta with 5 sequences and 404 positions.
NULL
#' genome locus
#'
#' The local genome map shows the 30000 sites around the TP53 gene.
#'
#' @docType data
#' @keywords datasets
#' @name TP53_genes.xlsx
#' @format xlsx
NULL