COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20200903095130/https://github.com/topics/sequence-clustering
Here are
9 public repositories
matching this topic...
MMseqs2: ultra fast and sensitive search and clustering suite
MeShClust: an intelligent tool for clustering DNA sequences
Bioinformatic Tools for analyzing targeted amplicon sequencing developed by Nicholas Hathaway of Bailey Lab
Cluster up to millions of peptide sequences on shared sequence motifs.
MeShClust2: Application of alignment-free identity scores in clustering long DNA sequences
Code to reproduce the experiments and the proposed visualization from 'Data mining in the development of mHealth apps: assessing in-app navigation through Markov Chain analysis'
Updated
May 8, 2019
Jupyter Notebook
CD HIT cluster file parser
Updated
Apr 11, 2018
Python
Dereplicate long sequences
Improve this page
Add a description, image, and links to the
sequence-clustering
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
sequence-clustering
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.