translation - Fast Sequence Alignment on Unicode Strings -
I want to run something like a blast algorithm for the query of a large database of unit strings. Most alignment software like blast is expected to input nucleotide or protein strings. But my input could potentially contain a Unicode character, is there any piece of software that will allow me to do this? Scoring matrix just identifies the matrix (can not be a partial match.)
I have tried Neidlon-Wansche and Smith Waterman, but for my purpose they are very slow to blast me a big database Should ask.
Thank you!
Blast can be used to align the sequence of letters of any alphabet. The publicly available implementation is consistent with the protein because you might need to apply it yourself, but the algorithm is not specific for protein or nucleotide sequences.
Comments
Post a Comment