D-ORB

The Overrepresented RNA Blocks

M.J. Dupont, F. Major
D-ORB: A Web Server to Extract Structural Features of Related But Unaligned RNA Sequences
J. Mol. Biol., 435 (2023)

Coronavirus 3' UTR pseudoknot   (Rfam: RF00165)


Options
Negatives sequences: Random sequences of similar lengths
ORBs positions sequence: Best consensus structure-matching sequence
Motifs: 162,498 motifs on the alphabet ACGURYN
Rfam
D-ORB
Augmented

(.NNYAN.)
Y(NN)R
N(C(U))
((Y)Y)N

97.8
100.0
100.0
100.0

Y(NN)R
N(C(U))
((Y)Y)N

% of positive sequence with ORB






Deep neural network:

5-fold cross-validation: 99.1 %
(± 0.315 %)

Decision tree:

5-fold cross-validation: 86.9 %
(± 6.55 %)

D-ORB structure
D-ORB structure decision tree:

5-fold cross-validation: 83 %
(± 4.93 %)
of NC_017083.1/30855-30916
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
(((.(((.......)..)).))).(.(((((((((((((((..).))))))))))))).).)
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012

abstract shape: ()()

Number of sequences
Structure ORBs (out of 45) (%)
1 (.NNYAN.)
44 97.8
>NC_016992.1/25796-25854
CUACUCUAGCACAGAAUCACAUCUCGAUAAGCAACAGUGCUAGAAGGUUGCUUAUACCA
.................................................(......)..
>NC_028814.1/27384-27446
CUACUCUUAUACAGAAUGGUAGGCUCGUAUAUAAGCUAGUAUAAGUAGAGUUUAUAUAUAUUG
.......(......)................................................
>NC_030886.1/29918-29976
CUACUCUUAUACAGAAUGAGAUCCUAGUGUACUGCAGUAUAAGAAGGCAUUGCACUCGC
........(.....)............................................
>NC_028811.1/27709-27771
CUGGUCUUAUACACAACGGUAGUCCAGUGGUAAUUUCAGUAUAAGAAGGAAAUCACCAUAUUG
..........(.....)..............................................
>NC_009021.1/28893-28951
CUACUCUUAUACAGAAUGGAAUCCUAGUGUACAGUGGUAUAAGUAAGCUGUGCAUUCGC
......(.....)..............................................
>NC_030292.1/28198-28261
CUACUCUUACACAGAAUGGUAAGCACGUAUCUAUGCAGGGUGUAAGUAACUCAUAGAUAUAUUA
......(.....)...................................................
>NC_003436.1/27820-27882
UUGGUCUUGCACACAACGGUAAGCCAGUGGUAAUGUCAGUGCAAGAAGGAUAUUACCAUAGCA
..........(......).............................................
>NC_010437.1/28097-28156
CUACUUAUACUAAAAUGUAAGCCUGUAUUUAAGCAGUAUAAGCAAUACUUAAAUAUAUUA
.......(.....)..............................................
>NC_022103.1/27934-27996
CUAGUCUUGCACACAAUGGUAAGCCAGUAGUAAUGACAGUGCAAGUAGGUUAUUACUAUAUUA
..........(......).............................................
>NC_011550.1/26266-26324
CUACUCUUGCACAGAAUCACAUCUCGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCA
...............................................(.......)...
>NC_028833.1/27465-27528
UCGGUCUUGUGCACAACGGUAAGCCAGUGGUCAUGUCAGCACAAUGAAGGAUAUGACCAUGUUG
......................................(........)................
>NC_005831.2/27340-27401
UUAGUCUUACACACAAUGGUAGGCCAGUGAUAGUAAAGUGUAAGUAAUUUGCUAUCAUAUUA
..........(.....).............................................
>NC_035191.1/25787-25847
CUCGUCUUAUCCAUAAGAACUAGCCUGUCAUAUAGUAGGAUAAGUAGGCUAUAUGAUUUUA
.....................................(.......)...............
>NC_011547.1/26204-26262
CUACUCUUGCACAGAAUCACCUUUCUAUAAUAACCAGUGCAAGAAGGGUUAUUAUACCA
......................(.....)..............................
>AC_000192.1/31270-31331
ACACUCUCUAUCAGAAUGGAUGUCUUGCUGUCAUAACAGAUAGAGAAGGUUGUGGCAGACCC
....................................................(.....)...
>NC_028752.1/27169-27231
CUAGUCUUAUACACAAUGGUAAGCCAGUAGUAGUAGAGGUAUAAGAAAUUUGCUACUAUGUUA
......(.....)..................................................
>NC_009020.1/30235-30294
CCACUCUUGCACAGAAUGGAAUCAUGUUUUACUUACAGUGCAAGAAGGUAAGUGAACCCA
...................(.........)..............................
>NC_039207.1/29899-29959
CUACUCUUGCACAGAAUGGAAUCAUGUUGUAACUACAGUGCAAGAAGGUAGGUACAAUCCA
...................................................(......)..
>NC_016995.1/26077-26137
CUGGGCUUGCAGCUAACCACUCCAUCAUUAUUAACACUGCAAGAAGGUUAAUAAUGACUCU
........(.......)............................................
>NC_016993.1/26403-26461
CUACUCUUGCACAGAAUCACAUCCUGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCG
......(.....)..............................................
>NC_016994.1/25811-25872
CUACUCGUGCAGAGAAUCACCCUAUGGCUAACUAACACUGCACGAAGGUUAGUAGCUUCUGA
......(.....).................................................
>NC_025217.1/31231-31289
CUACACUUGUGCUGAAUGGAUUCUAUGUAUAGUGUAGCACAAGAAGACACUAUACACGU
.................................................(.......).
>NC_028824.1/26750-26812
UCAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
......(.....)..................................................
>NC_011549.1/26110-26168
CUACUCUUGCACAGAAUCACAUCUCGAUAAGUGUCAGUGCAAGAAGGACACUUAUACCA
.................................................(.....)...
>NC_014470.1/28988-29047
CUACUCUUGUGCAGAAUGAAUUCUCGUAGCUAAACAGCACAAGUAGGUUUAGUUAACUUU
..................................................(......)..
>NC_038861.1/28363-28425
CUACUCUUGUACAGAAUGGUAAGCACGUGUAAUAGGAGGUACAAGCAACCCUAUUGCAUAUUA
......(.....)..................................................
>NC_009019.1/30041-30100
CCACUCUUGCACAGAAUGGAAUCAUGUUAAACUUACAGUGCAAGAAAGUAAGUUAACCCA
......(.....)...............................................
>NC_032107.1/28249-28311
UUAGUCUUACACACAAUGGUAGGCCAGUGGUAGUGAGAGUGUAAGUAGCUUACUAUCAUAUUA
..........(.....)..............................................
>NC_039208.1/25138-25196
CUACUCUAGCACAGAAUCACAUCCCGAUAAUCAACAGUGCUAGAAGGUUGAUUAUACCA
...............................................(.......)...
>NC_004718.3/29460-29519
CUACUCUUGUGCAGAAUGAAUUCUCGUAACUAAACAGCACAAGUAGGUUUAGUUAACUUU
..................................................(......)..
>NC_006577.2/29704-29764
ACACUCUCUAUCAGAAUGAAUUCUUGCUGUAAUAACAGAUAGAGUAGGUUGUUACAGACUA
...................................................(.....)...
>NC_032730.1/28500-28565
ACACUAGUGUACAGAAUCAUUACCACGUCUAUAGUAGGGUACACAUAACUAUCUAUAGAUAUAGAA
........................................(.....)...................
>NC_034972.1/27422-27487
ACACUAGUGUACAGAAUCAUUGCCACGUCUAUAGUAGGGUACACAUAACUAUCUAUAGAUAUAGAA
........................................(.....)...................
>NC_002645.1/27063-27125
CUAGUCUUAUACACAAUGGUAAGCCAGUGGUAGUAAAGGUAUAAGAAAUUUGCUACUAUGUUA
......(.....)..................................................
>NC_016996.1/25944-26002
CUACUCAUGCACAGAAUCACCUUUCAUUAUCUAACAGUGCAUGAAGGUUAGAUAUUCCA
.....................(......)..............................
>NC_009657.1/27966-28028
UUGGUCUUGCACACAACGGUAAGCCUGUAAUAAUGACAGUGCAAGCAGGUUAUUAUUAUAUUG
..........(......).............................................
>NC_009988.1/26936-26998
UUAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
........(.....)................................................
>NC_006213.1/30484-30545
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGAAGGUUAUAGCAGACUA
....................................................(.....)...
>NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
........(.....)...............................................
>NC_018871.1/28279-28341
CUACUCUUAUACAGAAUGGUAAGCCUGUUUAUAAGCUAGUAUAAGUAGAGUUUAUAGAUAUUG
.......(......)................................................
>NC_010438.1/28542-28603
GCAGUCUUAUACACUAUGGUAAGCCUGUAAUUAAAUAGUAUAAGCAAUGUUUAAUUAUAUUA
...........(.....)............................................
>NC_016991.1/25754-25812
CUGCUCAUGCACAGAACCAACUACCGAUAACUAACAGUGCAUGAAGGUUAGUUAUUCCA
....................................(.......)..............
>NC_045512.2/29603-29662
CUACUCUUGUGCAGAAUGAAUUCUCGUAACUACAUAGCACAAGUAGAUGUAGUUAACUUU
....................(........)..............................
>NC_038294.1/29868-29928
CCACUCUUGCACAGAAUGGAAUCAUGUUGUAAUUACAGUGCAAUAAGGUAAUUAUAACCCA
......(.....)................................................
2 (.NNYAN.)
Y(NN)R
37 82.2
>NC_016993.1/26403-26461
CUACUCUUGCACAGAAUCACAUCCUGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCG
......(.....)..........................(..)................
>NC_010438.1/28542-28603
GCAGUCUUAUACACUAUGGUAAGCCUGUAAUUAAAUAGUAUAAGCAAUGUUUAAUUAUAUUA
...........(.....)......................(..)..................
>NC_009020.1/30235-30294
CCACUCUUGCACAGAAUGGAAUCAUGUUUUACUUACAGUGCAAGAAGGUAAGUGAACCCA
...................(.........)......................(..)....
>NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
........(.....).........................(..)..................
>NC_010437.1/28097-28156
CUACUUAUACUAAAAUGUAAGCCUGUAUUUAAGCAGUAUAAGCAAUACUUAAAUAUAUUA
.......(.....)........................(..)..................
>NC_030886.1/29918-29976
CUACUCUUAUACAGAAUGAGAUCCUAGUGUACUGCAGUAUAAGAAGGCAUUGCACUCGC
........(.....)........................(..)................
>NC_009657.1/27966-28028
UUGGUCUUGCACACAACGGUAAGCCUGUAAUAAUGACAGUGCAAGCAGGUUAUUAUUAUAUUG
..........(......).......................(..)..................
>NC_038861.1/28363-28425
CUACUCUUGUACAGAAUGGUAAGCACGUGUAAUAGGAGGUACAAGCAACCCUAUUGCAUAUUA
......(.....)..........................(..)....................
>NC_016996.1/25944-26002
CUACUCAUGCACAGAAUCACCUUUCAUUAUCUAACAGUGCAUGAAGGUUAGAUAUUCCA
.....................(......)...................(..).......
>NC_035191.1/25787-25847
CUCGUCUUAUCCAUAAGAACUAGCCUGUCAUAUAGUAGGAUAAGUAGGCUAUAUGAUUUUA
.......(......).........................(..).................
>NC_028752.1/27169-27231
CUAGUCUUAUACACAAUGGUAAGCCAGUAGUAGUAGAGGUAUAAGAAAUUUGCUACUAUGUUA
......(.....)............................(..)..................
>NC_011550.1/26266-26324
CUACUCUUGCACAGAAUCACAUCUCGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCA
......(.....)..........................(..)................
>NC_045512.2/29603-29662
CUACUCUUGUGCAGAAUGAAUUCUCGUAACUACAUAGCACAAGUAGAUGUAGUUAACUUU
....................(........)......................(..)....
>NC_011549.1/26110-26168
CUACUCUUGCACAGAAUCACAUCUCGAUAAGUGUCAGUGCAAGAAGGACACUUAUACCA
..............(......)............................(..).....
>NC_038294.1/29868-29928
CCACUCUUGCACAGAAUGGAAUCAUGUUGUAAUUACAGUGCAAUAAGGUAAUUAUAACCCA
......(.....).........................(..)...................
>NC_032107.1/28249-28311
UUAGUCUUACACACAAUGGUAGGCCAGUGGUAGUGAGAGUGUAAGUAGCUUACUAUCAUAUUA
..........(.....)........................(..)..................
>NC_039208.1/25138-25196
CUACUCUAGCACAGAAUCACAUCCCGAUAAUCAACAGUGCUAGAAGGUUGAUUAUACCA
......(.....)..........................(..)................
>NC_003436.1/27820-27882
UUGGUCUUGCACACAACGGUAAGCCAGUGGUAAUGUCAGUGCAAGAAGGAUAUUACCAUAGCA
..........(......).......................(..)..................
>NC_028811.1/27709-27771
CUGGUCUUAUACACAACGGUAGUCCAGUGGUAAUUUCAGUAUAAGAAGGAAAUCACCAUAUUG
..........(.....)........................(..)..................
>NC_016991.1/25754-25812
CUGCUCAUGCACAGAACCAACUACCGAUAACUAACAGUGCAUGAAGGUUAGUUAUUCCA
......(.....)........................(..)..................
>NC_022103.1/27934-27996
CUAGUCUUGCACACAAUGGUAAGCCAGUAGUAAUGACAGUGCAAGUAGGUUAUUACUAUAUUA
..........(......).......................(..)..................
>NC_016994.1/25811-25872
CUACUCGUGCAGAGAAUCACCCUAUGGCUAACUAACACUGCACGAAGGUUAGUAGCUUCUGA
......(.....)...........................(..)..................
>NC_006213.1/30484-30545
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGAAGGUUAUAGCAGACUA
........(.....).........................(..)..................
>NC_011547.1/26204-26262
CUACUCUUGCACAGAAUCACCUUUCUAUAAUAACCAGUGCAAGAAGGGUUAUUAUACCA
......................(.....).......................(..)...
>NC_016992.1/25796-25854
CUACUCUAGCACAGAAUCACAUCUCGAUAAGCAACAGUGCUAGAAGGUUGCUUAUACCA
......(.....)..........................(..)................
>NC_002645.1/27063-27125
CUAGUCUUAUACACAAUGGUAAGCCAGUGGUAGUAAAGGUAUAAGAAAUUUGCUACUAUGUUA
......(.....)............................(..)..................
>NC_039207.1/29899-29959
CUACUCUUGCACAGAAUGGAAUCAUGUUGUAACUACAGUGCAAGAAGGUAGGUACAAUCCA
......(.....)...........................(..).................
>NC_006577.2/29704-29764
ACACUCUCUAUCAGAAUGAAUUCUUGCUGUAAUAACAGAUAGAGUAGGUUGUUACAGACUA
........(.....)........................(..)..................
>NC_009988.1/26936-26998
UUAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
........(.....)..........................(..)..................
>NC_028814.1/27384-27446
CUACUCUUAUACAGAAUGGUAGGCUCGUAUAUAAGCUAGUAUAAGUAGAGUUUAUAUAUAUUG
.......(......)........................(..)....................
>NC_030292.1/28198-28261
CUACUCUUACACAGAAUGGUAAGCACGUAUCUAUGCAGGGUGUAAGUAACUCAUAGAUAUAUUA
......(.....).............................(..)..................
>NC_009019.1/30041-30100
CCACUCUUGCACAGAAUGGAAUCAUGUUAAACUUACAGUGCAAGAAAGUAAGUUAACCCA
......(.....).........................(..)..................
>NC_018871.1/28279-28341
CUACUCUUAUACAGAAUGGUAAGCCUGUUUAUAAGCUAGUAUAAGUAGAGUUUAUAGAUAUUG
.......(......)........................(..)....................
>NC_028824.1/26750-26812
UCAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
......(.....)......................(..)........................
>NC_016995.1/26077-26137
CUGGGCUUGCAGCUAACCACUCCAUCAUUAUUAACACUGCAAGAAGGUUAAUAAUGACUCU
........(.......)......................(..)..................
>NC_009021.1/28893-28951
CUACUCUUAUACAGAAUGGAAUCCUAGUGUACAGUGGUAUAAGUAAGCUGUGCAUUCGC
......(.....)........................(..)..................
>NC_005831.2/27340-27401
UUAGUCUUACACACAAUGGUAGGCCAGUGAUAGUAAAGUGUAAGUAAUUUGCUAUCAUAUUA
..........(.....).......................(..)..................
3 (.NNYAN.)
Y(NN)R
N(C(U))
26 57.8
>NC_038294.1/29868-29928
CCACUCUUGCACAGAAUGGAAUCAUGUUGUAAUUACAGUGCAAUAAGGUAAUUAUAACCCA
..(.(.(.....).....))..................(..)...................
>NC_028752.1/27169-27231
CUAGUCUUAUACACAAUGGUAAGCCAGUAGUAGUAGAGGUAUAAGAAAUUUGCUACUAUGUUA
....(.(...(......)...))..................(..)..................
>NC_030886.1/29918-29976
CUACUCUUAUACAGAAUGAGAUCCUAGUGUACUGCAGUAUAAGAAGGCAUUGCACUCGC
..(.(...(.....)...).)..................(..)................
>NC_009657.1/27966-28028
UUGGUCUUGCACACAACGGUAAGCCUGUAAUAAUGACAGUGCAAGCAGGUUAUUAUUAUAUUG
....(.(...(......)...))..................(..)..................
>NC_016991.1/25754-25812
CUGCUCAUGCACAGAACCAACUACCGAUAACUAACAGUGCAUGAAGGUUAGUUAUUCCA
..(.(.(.....)..).)...................(..)..................
>NC_016992.1/25796-25854
CUACUCUAGCACAGAAUCACAUCUCGAUAAGCAACAGUGCUAGAAGGUUGCUUAUACCA
..(.(.(.....).)).......................(..)................
>NC_022103.1/27934-27996
CUAGUCUUGCACACAAUGGUAAGCCAGUAGUAAUGACAGUGCAAGUAGGUUAUUACUAUAUUA
....(.(...(......)...))..................(..)..................
>NC_032107.1/28249-28311
UUAGUCUUACACACAAUGGUAGGCCAGUGGUAGUGAGAGUGUAAGUAGCUUACUAUCAUAUUA
....(.(...(.....)...).)..................(..)..................
>NC_028811.1/27709-27771
CUGGUCUUAUACACAACGGUAGUCCAGUGGUAAUUUCAGUAUAAGAAGGAAAUCACCAUAUUG
....(.(...(.....)...).)..................(..)..................
>NC_009019.1/30041-30100
CCACUCUUGCACAGAAUGGAAUCAUGUUAAACUUACAGUGCAAGAAAGUAAGUUAACCCA
..(.(.(.....).....))..................(..)..................
>NC_009988.1/26936-26998
UUAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
....(.(.(.....)...)..)...................(..)..................
>NC_011547.1/26204-26262
CUACUCUUGCACAGAAUCACCUUUCUAUAAUAACCAGUGCAAGAAGGGUUAUUAUACCA
...................(.((.....)...))..................(..)...
>NC_016995.1/26077-26137
CUGGGCUUGCAGCUAACCACUCCAUCAUUAUUAACACUGCAAGAAGGUUAAUAAUGACUCU
....(.(.(.......)..))..................(..)..................
>NC_003436.1/27820-27882
UUGGUCUUGCACACAACGGUAAGCCAGUGGUAAUGUCAGUGCAAGAAGGAUAUUACCAUAGCA
....(.(...(......)...))..................(..)..................
>NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
....(.(.(.....)....).)..................(..)..................
>NC_011550.1/26266-26324
CUACUCUUGCACAGAAUCACAUCUCGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCA
..(.(.(.....).....))...................(..)................
>NC_016993.1/26403-26461
CUACUCUUGCACAGAAUCACAUCCUGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCG
..(.(.(.....)..).....).................(..)................
>NC_005831.2/27340-27401
UUAGUCUUACACACAAUGGUAGGCCAGUGAUAGUAAAGUGUAAGUAAUUUGCUAUCAUAUUA
....(.(...(.....)...).).................(..)..................
>NC_028814.1/27384-27446
CUACUCUUAUACAGAAUGGUAGGCUCGUAUAUAAGCUAGUAUAAGUAGAGUUUAUAUAUAUUG
..(.(..(......)...))...................(..)....................
>NC_016994.1/25811-25872
CUACUCGUGCAGAGAAUCACCCUAUGGCUAACUAACACUGCACGAAGGUUAGUAGCUUCUGA
..(.(.(.....).....))....................(..)..................
>NC_030292.1/28198-28261
CUACUCUUACACAGAAUGGUAAGCACGUAUCUAUGCAGGGUGUAAGUAACUCAUAGAUAUAUUA
..(.(.(.....).....).).....................(..)..................
>NC_010438.1/28542-28603
GCAGUCUUAUACACUAUGGUAAGCCUGUAAUUAAAUAGUAUAAGCAAUGUUUAAUUAUAUUA
....(.(....(.....)...)).................(..)..................
>NC_038861.1/28363-28425
CUACUCUUGUACAGAAUGGUAAGCACGUGUAAUAGGAGGUACAAGCAACCCUAUUGCAUAUUA
..(.(.(.....)..)....)....................(..)..................
>NC_009021.1/28893-28951
CUACUCUUAUACAGAAUGGAAUCCUAGUGUACAGUGGUAUAAGUAAGCUGUGCAUUCGC
..(.(.(.....)....))..................(..)..................
>NC_010437.1/28097-28156
CUACUUAUACUAAAAUGUAAGCCUGUAUUUAAGCAGUAUAAGCAAUACUUAAAUAUAUUA
..(.(..(.....)...))...................(..)..................
>NC_039207.1/29899-29959
CUACUCUUGCACAGAAUGGAAUCAUGUUGUAACUACAGUGCAAGAAGGUAGGUACAAUCCA
..(.(.(.....)......).)..................(..).................
4 (.NNYAN.)
Y(NN)R
N(C(U))
((Y)Y)N
17 37.8
>NC_030886.1/29918-29976
CUACUCUUAUACAGAAUGAGAUCCUAGUGUACUGCAGUAUAAGAAGGCAUUGCACUCGC
..(.(...(.....)...).)..(.(.............(..)............).).
>NC_011550.1/26266-26324
CUACUCUUGCACAGAAUCACAUCUCGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCA
..(.(.(.....).....))...(.(.............(..).............).)
>NC_009657.1/27966-28028
UUGGUCUUGCACACAACGGUAAGCCUGUAAUAAUGACAGUGCAAGCAGGUUAUUAUUAUAUUG
....(.(...(......)...))..((..............(..)...............).)
>NC_028752.1/27169-27231
CUAGUCUUAUACACAAUGGUAAGCCAGUAGUAGUAGAGGUAUAAGAAAUUUGCUACUAUGUUA
....(.(...(......)...))...((.............(..)...............).)
>NC_005831.2/27340-27401
UUAGUCUUACACACAAUGGUAGGCCAGUGAUAGUAAAGUGUAAGUAAUUUGCUAUCAUAUUA
....(.(...(.....)....))..((.............(..)...............).)
>NC_009021.1/28893-28951
CUACUCUUAUACAGAAUGGAAUCCUAGUGUACAGUGGUAUAAGUAAGCUGUGCAUUCGC
..(.(.(.....)....))...((.............(..)..............).).
>NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
....(.(.(.....)....).)..(.(.............(..)...............).)
>NC_009988.1/26936-26998
UUAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
....(.(.(.....)...)..)...((..............(..)...............).)
>NC_038861.1/28363-28425
CUACUCUUGUACAGAAUGGUAAGCACGUGUAAUAGGAGGUACAAGCAACCCUAUUGCAUAUUA
..(.(.(.....)..)....)..(..(..............(..)...............).)
>NC_038294.1/29868-29928
CCACUCUUGCACAGAAUGGAAUCAUGUUGUAAUUACAGUGCAAUAAGGUAAUUAUAACCCA
..(.(.(.....).....))..(..(............(..)...............).).
>NC_016993.1/26403-26461
CUACUCUUGCACAGAAUCACAUCCUGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCG
..(.(.(.....)..).....)..((.............(..).............).)
>NC_010437.1/28097-28156
CUACUUAUACUAAAAUGUAAGCCUGUAUUUAAGCAGUAUAAGCAAUACUUAAAUAUAUUA
..(.(..(.....)...))..(.(..............(..)...............).)
>NC_009019.1/30041-30100
CCACUCUUGCACAGAAUGGAAUCAUGUUAAACUUACAGUGCAAGAAAGUAAGUUAACCCA
..(.(.(.....).....))..(.(.............(..)...............).)
>NC_016991.1/25754-25812
CUGCUCAUGCACAGAACCAACUACCGAUAACUAACAGUGCAUGAAGGUUAGUUAUUCCA
..(.(.(.....)..).)..(..(.............(..)...............).)
>NC_028814.1/27384-27446
CUACUCUUAUACAGAAUGGUAGGCUCGUAUAUAAGCUAGUAUAAGUAGAGUUUAUAUAUAUUG
..(.(..(......)...))..(..(.............(..).................).)
>NC_016994.1/25811-25872
CUACUCGUGCAGAGAAUCACCCUAUGGCUAACUAACACUGCACGAAGGUUAGUAGCUUCUGA
..(.(.(.....).....))...(.(..............(..).............).)..
>NC_022103.1/27934-27996
CUAGUCUUGCACACAAUGGUAAGCCAGUAGUAAUGACAGUGCAAGUAGGUUAUUACUAUAUUA
....(.(...(......)...))..((..............(..)...............).)

>NC_028814.1/27384-27446
CUACUCUUAUACAGAAUGGUAGGCUCGUAUAUAAGCUAGUAUAAGUAGAGUUUAUAUAUAUUG
.......(......)................................................
.......(......)........................(..)....................
..(.(..(......)...))...................(..)....................
..(.(..(......)...))..(..(.............(..).................).)
>NC_030886.1/29918-29976
CUACUCUUAUACAGAAUGAGAUCCUAGUGUACUGCAGUAUAAGAAGGCAUUGCACUCGC
........(.....)............................................
........(.....)........................(..)................
..(.(...(.....)...).)..................(..)................
..(.(...(.....)...).)..(.(.............(..)............).).
>NC_009021.1/28893-28951
CUACUCUUAUACAGAAUGGAAUCCUAGUGUACAGUGGUAUAAGUAAGCUGUGCAUUCGC
......(.....)..............................................
......(.....)........................(..)..................
..(.(.(.....)....))..................(..)..................
..(.(.(.....)....))...((.............(..)..............).).
>NC_010437.1/28097-28156
CUACUUAUACUAAAAUGUAAGCCUGUAUUUAAGCAGUAUAAGCAAUACUUAAAUAUAUUA
.......(.....)..............................................
.......(.....)........................(..)..................
..(.(..(.....)...))...................(..)..................
..(.(..(.....)...))..(.(..............(..)...............).)
>NC_022103.1/27934-27996
CUAGUCUUGCACACAAUGGUAAGCCAGUAGUAAUGACAGUGCAAGUAGGUUAUUACUAUAUUA
..........(......).............................................
..........(......).......................(..)..................
....(.(...(......)...))..................(..)..................
....(.(...(......)...))..((..............(..)...............).)
>NC_011550.1/26266-26324
CUACUCUUGCACAGAAUCACAUCUCGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCA
...............................................(.......)...
......(.....)..........................(..)................
..(.(.(.....).....))...................(..)................
..(.(.(.....).....))...(.(.............(..).............).)
>NC_005831.2/27340-27401
UUAGUCUUACACACAAUGGUAGGCCAGUGAUAGUAAAGUGUAAGUAAUUUGCUAUCAUAUUA
..........(.....).............................................
..........(.....).......................(..)..................
....(.(...(.....)...).).................(..)..................
....(.(...(.....)....))..((.............(..)...............).)
>NC_028752.1/27169-27231
CUAGUCUUAUACACAAUGGUAAGCCAGUAGUAGUAGAGGUAUAAGAAAUUUGCUACUAUGUUA
......(.....)..................................................
......(.....)............................(..)..................
....(.(...(......)...))..................(..)..................
....(.(...(......)...))...((.............(..)...............).)
>NC_016993.1/26403-26461
CUACUCUUGCACAGAAUCACAUCCUGAUAAUUGUCAGUGCAAGAAGGACAAUUAUACCG
......(.....)..............................................
......(.....)..........................(..)................
..(.(.(.....)..).....).................(..)................
..(.(.(.....)..).....)..((.............(..).............).)
>NC_016994.1/25811-25872
CUACUCGUGCAGAGAAUCACCCUAUGGCUAACUAACACUGCACGAAGGUUAGUAGCUUCUGA
......(.....).................................................
......(.....)...........................(..)..................
..(.(.(.....).....))....................(..)..................
..(.(.(.....).....))...(.(..............(..).............).)..
>NC_038861.1/28363-28425
CUACUCUUGUACAGAAUGGUAAGCACGUGUAAUAGGAGGUACAAGCAACCCUAUUGCAUAUUA
......(.....)..................................................
......(.....)..........................(..)....................
..(.(.(.....)..)....)....................(..)..................
..(.(.(.....)..)....)..(..(..............(..)...............).)
>NC_009019.1/30041-30100
CCACUCUUGCACAGAAUGGAAUCAUGUUAAACUUACAGUGCAAGAAAGUAAGUUAACCCA
......(.....)...............................................
......(.....).........................(..)..................
..(.(.(.....).....))..................(..)..................
..(.(.(.....).....))..(.(.............(..)...............).)
>NC_009657.1/27966-28028
UUGGUCUUGCACACAACGGUAAGCCUGUAAUAAUGACAGUGCAAGCAGGUUAUUAUUAUAUUG
..........(......).............................................
..........(......).......................(..)..................
....(.(...(......)...))..................(..)..................
....(.(...(......)...))..((..............(..)...............).)
>NC_009988.1/26936-26998
UUAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
........(.....)................................................
........(.....)..........................(..)..................
....(.(.(.....)...)..)...................(..)..................
....(.(.(.....)...)..)...((..............(..)...............).)
>NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
........(.....)...............................................
........(.....).........................(..)..................
....(.(.(.....)....).)..................(..)..................
....(.(.(.....)....).)..(.(.............(..)...............).)
>NC_016991.1/25754-25812
CUGCUCAUGCACAGAACCAACUACCGAUAACUAACAGUGCAUGAAGGUUAGUUAUUCCA
....................................(.......)..............
......(.....)........................(..)..................
..(.(.(.....)..).)...................(..)..................
..(.(.(.....)..).)..(..(.............(..)...............).)
>NC_038294.1/29868-29928
CCACUCUUGCACAGAAUGGAAUCAUGUUGUAAUUACAGUGCAAUAAGGUAAUUAUAACCCA
......(.....)................................................
......(.....).........................(..)...................
..(.(.(.....).....))..................(..)...................
..(.(.(.....).....))..(..(............(..)...............).).
>NC_016992.1/25796-25854
CUACUCUAGCACAGAAUCACAUCUCGAUAAGCAACAGUGCUAGAAGGUUGCUUAUACCA
.................................................(......)..
......(.....)..........................(..)................
..(.(.(.....).)).......................(..)................
>NC_028811.1/27709-27771
CUGGUCUUAUACACAACGGUAGUCCAGUGGUAAUUUCAGUAUAAGAAGGAAAUCACCAUAUUG
..........(.....)..............................................
..........(.....)........................(..)..................
....(.(...(.....)...).)..................(..)..................
>NC_030292.1/28198-28261
CUACUCUUACACAGAAUGGUAAGCACGUAUCUAUGCAGGGUGUAAGUAACUCAUAGAUAUAUUA
......(.....)...................................................
......(.....).............................(..)..................
..(.(.(.....).....).).....................(..)..................
>NC_003436.1/27820-27882
UUGGUCUUGCACACAACGGUAAGCCAGUGGUAAUGUCAGUGCAAGAAGGAUAUUACCAUAGCA
..........(......).............................................
..........(......).......................(..)..................
....(.(...(......)...))..................(..)..................
>NC_011547.1/26204-26262
CUACUCUUGCACAGAAUCACCUUUCUAUAAUAACCAGUGCAAGAAGGGUUAUUAUACCA
......................(.....)..............................
......................(.....).......................(..)...
...................(.((.....)...))..................(..)...
>NC_039207.1/29899-29959
CUACUCUUGCACAGAAUGGAAUCAUGUUGUAACUACAGUGCAAGAAGGUAGGUACAAUCCA
...................................................(......)..
......(.....)...........................(..).................
..(.(.(.....)......).)..................(..).................
>NC_016995.1/26077-26137
CUGGGCUUGCAGCUAACCACUCCAUCAUUAUUAACACUGCAAGAAGGUUAAUAAUGACUCU
........(.......)............................................
........(.......)......................(..)..................
....(.(.(.......)..))..................(..)..................
>NC_032107.1/28249-28311
UUAGUCUUACACACAAUGGUAGGCCAGUGGUAGUGAGAGUGUAAGUAGCUUACUAUCAUAUUA
..........(.....)..............................................
..........(.....)........................(..)..................
....(.(...(.....)...).)..................(..)..................
>NC_010438.1/28542-28603
GCAGUCUUAUACACUAUGGUAAGCCUGUAAUUAAAUAGUAUAAGCAAUGUUUAAUUAUAUUA
...........(.....)............................................
...........(.....)......................(..)..................
....(.(....(.....)...)).................(..)..................
>NC_035191.1/25787-25847
CUCGUCUUAUCCAUAAGAACUAGCCUGUCAUAUAGUAGGAUAAGUAGGCUAUAUGAUUUUA
.....................................(.......)...............
.......(......).........................(..).................
>NC_009020.1/30235-30294
CCACUCUUGCACAGAAUGGAAUCAUGUUUUACUUACAGUGCAAGAAGGUAAGUGAACCCA
...................(.........)..............................
...................(.........)......................(..)....
>NC_028824.1/26750-26812
UCAGUCUCAUACACAAUGGUAAGCACGUAAUUAUGCUAGUAUGAGUAGAGUAUAAUUAUAUUG
......(.....)..................................................
......(.....)......................(..)........................
>NC_011549.1/26110-26168
CUACUCUUGCACAGAAUCACAUCUCGAUAAGUGUCAGUGCAAGAAGGACACUUAUACCA
.................................................(.....)...
..............(......)............................(..).....
>NC_039208.1/25138-25196
CUACUCUAGCACAGAAUCACAUCCCGAUAAUCAACAGUGCUAGAAGGUUGAUUAUACCA
...............................................(.......)...
......(.....)..........................(..)................
>NC_006577.2/29704-29764
ACACUCUCUAUCAGAAUGAAUUCUUGCUGUAAUAACAGAUAGAGUAGGUUGUUACAGACUA
...................................................(.....)...
........(.....)........................(..)..................
>NC_002645.1/27063-27125
CUAGUCUUAUACACAAUGGUAAGCCAGUGGUAGUAAAGGUAUAAGAAAUUUGCUACUAUGUUA
......(.....)..................................................
......(.....)............................(..)..................
>NC_016996.1/25944-26002
CUACUCAUGCACAGAAUCACCUUUCAUUAUCUAACAGUGCAUGAAGGUUAGAUAUUCCA
.....................(......)..............................
.....................(......)...................(..).......
>NC_006213.1/30484-30545
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGAAGGUUAUAGCAGACUA
....................................................(.....)...
........(.....).........................(..)..................
>NC_018871.1/28279-28341
CUACUCUUAUACAGAAUGGUAAGCCUGUUUAUAAGCUAGUAUAAGUAGAGUUUAUAGAUAUUG
.......(......)................................................
.......(......)........................(..)....................
>NC_045512.2/29603-29662
CUACUCUUGUGCAGAAUGAAUUCUCGUAACUACAUAGCACAAGUAGAUGUAGUUAACUUU
....................(........)..............................
....................(........)......................(..)....
>NC_028833.1/27465-27528
UCGGUCUUGUGCACAACGGUAAGCCAGUGGUCAUGUCAGCACAAUGAAGGAUAUGACCAUGUUG
......................................(........)................
>AC_000192.1/31270-31331
ACACUCUCUAUCAGAAUGGAUGUCUUGCUGUCAUAACAGAUAGAGAAGGUUGUGGCAGACCC
....................................................(.....)...
>NC_025217.1/31231-31289
CUACACUUGUGCUGAAUGGAUUCUAUGUAUAGUGUAGCACAAGAAGACACUAUACACGU
.................................................(.......).
>NC_014470.1/28988-29047
CUACUCUUGUGCAGAAUGAAUUCUCGUAGCUAAACAGCACAAGUAGGUUUAGUUAACUUU
..................................................(......)..
>NC_004718.3/29460-29519
CUACUCUUGUGCAGAAUGAAUUCUCGUAACUAAACAGCACAAGUAGGUUUAGUUAACUUU
..................................................(......)..
>NC_032730.1/28500-28565
ACACUAGUGUACAGAAUCAUUACCACGUCUAUAGUAGGGUACACAUAACUAUCUAUAGAUAUAGAA
........................................(.....)...................
>NC_034972.1/27422-27487
ACACUAGUGUACAGAAUCAUUGCCACGUCUAUAGUAGGGUACACAUAACUAUCUAUAGAUAUAGAA
........................................(.....)...................

ORB p-value ORB frequency
(per structure per nucleotide)
Means
ratio
1 (.NNYAN.) < 0.001 4.03
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012
2 Y(NN)R < 0.001 2.66
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012
3 ((Y)Y)N < 0.001 1.72
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012
4 N(C(U)) < 0.001 1.66
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012
ORB p-value ORB frequency
(per structure per nucleotide)
Means
ratio

NCM p-value NCM frequency
(per structure per nucleotide)
Means
ratio
1 U(RYR)C < 0.001 37.4
2 A(C(U)) < 0.001 5.46
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012
3 (YNRR) < 0.001 3.25
NC_017083.1/30855-30916
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
GCACUCUCUAUCAGAAUGGAUGUCUUGCUGCUAUAAUAGAUAGAGUAGGUUAUAGCAGACCA
.........1.........2.........3.........4.........5.........6..
12345678901234567890123456789012345678901234567890123456789012
NCM p-value NCM frequency
(per structure per nucleotide)
Means
ratio

Structure is visualized using R2R 1.0.6 (Zasha Weinberg).
LOGOS are generated with ggseqlogo (Omar Wagih).