'How to find the length and proportion of nucleotides in a DNA sequence using R
Hi so I have a DNA sequence and need to find the length and nucleotide proportions. This should be easy using table() and length() however the way the file is formatted is making this difficult. R is reading it in not as a single sequence but 1 variable with 20 objects. How do I get around this?
4 558
P_acuticauda GCCATTTCATCCCAGCACAAACCTGGGAGCTCCAGCTGTGTCTGGAAAGCTTCGATGGGATCCCACCAGTCTGCAAAGCTTTTGTGTCAGGTATAGAGAT
P_hecki GCCATTTCATCCCAGCACAAACCTGGGAGCTCCAGCTGTGTCTGGAAAGCTTCGATGGGATCCCACCAGTCTGCAAAGCTTTTGTGTCAGGTATAGAGAT
P_cincta GCCATTTCATCCCAGCACAAAGCTGGGAGCTCCAGCTGTGGCTGGAAAGCTTCGATGGGATCCCACCAGTCTGCAAAGCTTTTGTGTCAGGTATAGAGAT
T_guttata GCCATTTCATCCCAGCACAAAGCTGGGAGCTCCAGCTGTGGCTGAAAAACTTCGATGGGATCCCACCGGTCTGCAAAGCTTTTGTGTCAGGTATAGAGAT
TGATGCATCAGCCTTGGGCTCTCAGAAATGCACCTGGTTTGTAGTGAATGTGAGATGTGTAACTGCTTAGTCAAGGACAAACCTTTTTCCTATTCTCCTC
TGATGCATCAGCCTTGGGCTCTCAGAAATGCACCTGGTTTGTAGTGAATGTGAGATGTGTAACTGCTTAGTCAAGGACAAACCTTTTTCCTATTCTCCTC
TGATGCATCAGCCTTGGGCTCTCAGAAATGCACCTGGTTTGTAGTGAATGTGAGATGTGTAACTGCTTAGTCAAGGACAAACCTTTTTCCTATTCTCCTC
TGATGCATCAGCCCTGGGCTCTCAGAAATGCACCTGGTTTGTAGTGAATGTGAGATGTGTAACTGCTTAGTCAAGGACAAACCTTTTTCCTATTCTCCTC
TCTGTACTGGCTTTTAAAATGTAGTATTTAGTGAAAAAAAACTCTCAATGATGTGTGTGCTACAAACCAGTTGTCCCAGTCAAAGCTATTTCAATAAGCA
TCTGTACTGGCTTTTAAAATGTAGTATTTAGTGAAAAAAAAGTCTCAATGATGTGTGTGCTACAAACCAGTTGTCCCAGTCAAAGTTATTTCAATAAGCA
TCTGTACTGGCTTTTAAAATGTAGTATTTAGTGAAAAAAAACTCTCAATGATGTGTGTGCTACAAACCAGTTGTCCCAGTCAAAGTTATTTCAATAAGCA
TCTATATTGGCTTTTAAAATGCAGTATTTAGTGAAAAAAAAGTCTCAATGATTTGTGTGCTACAAACCAGTTGTCCCAGTCAAAGTTATTTCAATAAGCA
GGAAATAATTTCTGTTATAACTTCAATTTTCAAGAACAGATTTTCTTGACTTGATTGCTTAAAATTGTTTGGGTTTATGTGGTGTTTTTTACCATTTGCA
GGAAATAATTTCTGTTATAACTTCAGTTTTCAAGAACAGATTTTCTTCACTTGATTGCTTAAAATTGTTTGGGTTTATGTGGTGTTTTTTACCATTTGCA
GGAAATAATTTCTGTTATAACTTCAATTTTCAAGAACAGATTTTCTTGACTTGATTGCTTAAAATTGTTTGGGTTTATGTGGTGTTTTTTACCATTTGCA
GGAAATTATTTCTTTTATAACTTCAATTTTCAAGAACAGATTTTCTTGACTTGATTGCTTAAAATTGTTTGGGTTTATGTGGTGTTTTGTACCATTTGCA
CCAGTTTAATGTGTACAGCATTTTTTTTTTTTCTGGATTCTTTCTAGAAACTTAACCTACAAAGAGGAAAGACTTTTTTATTGTCTTGAGGTGTGTTCAG
CCAGTTTAATGTGTACAGCATTTTTTTTTTTTCTGGATTCTTTCTAGAAACTTAACCTACAAAGAGGAAAGACTTTTTTATTGTCTTGAGGTGTGTTCAG
CCAGTTTAATGTGTACAGCATTTTTTTTTTTTCTGGATTCTTTCTAGAAACTTAACCTACAAAGAGGAAAGACTTTTTTATTGTCTTGAGGTGTGTTCAG
CCAGTTTAATGTGTACAGCATTTTTTTTTTTTCTGGATTCTTTCTAGGAACTTAACCTACAAAGAGGAAAGACTTTTTTATTGTCTTGAGGTGTGTTCAG
AAAAAGTAAAATAGATCTTTTGGAAAAAGACCTGTGATCAGATTTTTGCGAAGCAAAA
AAAAAGTAAAATAGATCTTTTGGAAAAAGACCTGTGATCAGATTTTTGCGAAGCAAAA
AAAAAGTAAAATAGATCTTTTGGAAAAAGACCTGTGATCAGATTTTTGCGAAGCAAAA
AAAAAGTAAAATAGATCTTTTGGAAAAAGACCTGTGATCAGATTTTTGCTAAGCAAAA
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
