The Wayback Machine - https://web.archive.org/web/20201129114606/https://github.com/eonu/sequentia
Skip to content
master
Go to file
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
lib
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Sequentia

A machine learning interface for isolated sequence classification algorithms in Python.

Introduction

Sequential data is one of the most commonly observed forms of data. These can range from time series (sequences of observations occurring through time) to non-temporal sequences such as DNA nucleotides. Time series such as audio signals and stock prices are often of particular interest as changing patterns over time naturally provide many interesting opportunities and challenges for machine learning.

This library specifically aims to tackle classification problems for isolated sequences by creating an interface to a number of classification algorithms.

Despite these types of sequences sounding very specific, you probably observe some of them on a regular basis!

Some examples of classification problems for isolated sequences include classifying:

  • a word utterance by its speech audio signal,
  • a hand-written character according to its pen-tip trajectory,
  • a hand or head gesture in a video or motion-capture recording.

Features

Sequentia offers the use of multivariate observation sequences with varying durations using the following methods:

Classification algorithms

  • Hidden Markov Models (via Pomegranate [1])
    Learning with the Baum-Welch algorithm [2]
    • Multivariate Gaussian emissions
    • Gaussian Mixture Model emissions (full and diagonal covariances)
    • Left-right and ergodic topologies
  • Approximate Dynamic Time Warping k-Nearest Neighbors (implemented with FastDTW [3])
    • Custom distance-weighted predictions
    • Multi-processed predictions


Example of a classification algorithm: a multi-class HMM isolated sequence classifier

Preprocessing methods

  • Centering, standardization and min-max scaling
  • Decimation and mean downsampling
  • Mean and median filtering

Installation

pip install sequentia

Documentation

Documentation for the package is available on Read The Docs.

Tutorials and examples

For tutorials and examples on the usage of Sequentia, look at the notebooks here.

References

[1] Jacob Schreiber. "pomegranate: Fast and Flexible Probabilistic Modeling in Python." Journal of Machine Learning Research 18 (2018), (164):1-6.
[2] Lawrence R. Rabiner. "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition" Proceedings of the IEEE 77 (1989), no. 2, pp. 257-86.
[3] Stan Salvador, and Philip Chan. "FastDTW: Toward accurate dynamic time warping in linear time and space." Intelligent Data Analysis 11.5 (2007), 561-580.

Contributors

All contributions to this repository are greatly appreciated. Contribution guidelines can be found here.

Edwin Onuonga
Edwin Onuonga

✉️ 🌍

Sequentia © 2019-2021, Edwin Onuonga - Released under the MIT License.
Authored and maintained by Edwin Onuonga.

You can’t perform that action at this time.