python-arpa

Python library for n-gram models in ARPA format

MIT License

Downloads
5.4K
Stars
38
Committers
1

Python ARPA Package

Python library for reading ARPA n-gram models.

Setup

Python 3.4+

In order to install the Python 3 version:

$ pip install --user -U arpa

Python 2.7

In order to install the Python 2.7 version:

$ pip install --user -U arpa-backport

Usage

The package may be imported directly:

import arpa  # Python 3.4+
# OR
import arpa_backport as arpa  # Python 2.7

models = arpa.loadf("foo.arpa")
lm = models[0]  # ARPA files may contain several models.

# probability p(end|in, the)
lm.p("in the end")
lm.log_p("in the end")

# sentence score w/ sentence markers
lm.s("This is the end .")
lm.log_s("This is the end .")

# sentence score w/o sentence markers
lm.s("This is the end .", sos=False, eos=False)
lm.log_s("This is the end .", sos=False, eos=False)

Development

Contributions are welcome! Write a bug report or send a pull request. Other contributors have done so before.

License

Copyright (c) 2015-2018 Stefan Fischer The source code is available under the MIT License. See LICENSE for further details.