This package aims to machine-read the portuguese law officially published in dre. Thanks for checking it out.
Allows to tokenize, parse and analyse the raw text from the official source, which is downloaded by its dependency pt_law_downloader.
In particular, this package structures the text according to the structure of the law, identifies references to other documents and to European law, and other perks.
Take a look at the tests to understand how you can use this package.
git clone git+https://github.com/publicos-pt/pt_law_parser.git
pip install -r requirements.txt
mkdir cached_html
python -m unittest discover