Python ARPA Package
Python library for reading ARPA n-gram models.
- Documentation is available.
- Changes between releases are documented.
- Bugs can be reported on the issue tracker.
- Questions can be asked via e-mail.
- Source code is tracked on GitHub.
Setup
Python 3.4+
In order to install the Python 3 version:
$ pip install --user -U arpa
Python 2.7
In order to install the Python 2.7 version:
$ pip install --user -U arpa-backport
Usage
The package may be imported directly:
import arpa # Python 3.4+
# OR
import arpa_backport as arpa # Python 2.7
models = arpa.loadf("foo.arpa")
lm = models[0] # ARPA files may contain several models.
# probability p(end|in, the)
lm.p("in the end")
lm.log_p("in the end")
# sentence score w/ sentence markers
lm.s("This is the end .")
lm.log_s("This is the end .")
# sentence score w/o sentence markers
lm.s("This is the end .", sos=False, eos=False)
lm.log_s("This is the end .", sos=False, eos=False)
Development
Contributions are welcome!
Write a bug report or send a pull request.
Other contributors have done so before.
License
Copyright (c) 2015-2018 Stefan Fischer
The source code is available under the MIT License.
See LICENSE for further details.

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
