Show HN: My 11th grade research project: faster DNA sequence duplicate removal (opens in new tab)

(peerj.com)

2 pointsc0deb0t6y ago1 comments

1 comments

I am a high school student, and this is a published paper that I wrote. If you want to read a shorter blog version of my work, go here: https://blog.liudaniel.com/n-grams-BK-trees.

The general problem is grouping similar DNA/RNA sequences based on something known as a Unique Molecular Identifier, and then collapsing those groups into consensus sequences. This helps estimate the number of unique sequences while efficiently accounting for errors in sequencing or PCR amplification.

If you have any questions, feel free to ask me!

j / k navigate · click thread line to collapse

1 comments

c0deb0tOP6y ago

I am a high school student, and this is a published paper that I wrote. If you want to read a shorter blog version of my work, go here: https://blog.liudaniel.com/n-grams-BK-trees.

If you have any questions, feel free to ask me!

j / k navigate · click thread line to collapse