why nucleotides is more than 4?
3 views (last 30 days)
Show older comments
hi as I know the no. of nucleotides is 4 letters. why in matlab consider it 17 letters as in table here:
thanks
0 Comments
Accepted Answer
Walter Roberson
on 16 Nov 2011
The table there looks pretty straight-forward to me: http://www.mathworks.com/help/toolbox/bioinfo/ref/int2nt.html#bp_rekb-1 . It has codes for situations in which particular sets of nucleotides are known to be present or known to be absent.
Besides, the number of known nucleotides is not 4: it is currently 8. The 7th and 8th were announced in July 2011, with the 5th and 6th having been announced in April 2005.
More Answers (1)
Lucio Cetto
on 19 Nov 2011
Ambiguous nucleotide symbols are used to characterize sequences that can have variations. It was introduced in the 80's and they are useful nowadays in certain cases, for example describing restriction enzymes. (e.g. http://www.chem.qmul.ac.uk/iubmb/misc/naseq.html). In my personal opinion I think that there are other situations in which we have better options, such as sequence motifs, sequence profiles and the more elaborated profile HMMs. If you plan to convert to aa, Matlab can actually use also ambiguous aa codes when possible, although this is no longer a standard practice; most people now uses only ACGT.
0 Comments
See Also
Categories
Find more on Genomics and Next Generation Sequencing in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!