Skip to content

Latest commit

 

History

History
12 lines (7 loc) · 263 Bytes

to_do.md

File metadata and controls

12 lines (7 loc) · 263 Bytes

To do on existing data

Scraping scripts

BoE

  • There are missing pdf and text must be extracted directly on the web page (notably for some speeches)
  • Extracting text from PDF

ECB

  • Simpler scraping method to implement for next update of the data