Skip to content

Scrape-O-Matic is a semi-artificial intelligence driven scientific information explorer

License

Notifications You must be signed in to change notification settings

hannu40k/scrapeomatic

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scrape-O-Matic

Introduction

A software to run Annif on various texts to produce relevant keywords and do a database search base on them. Input can be a text, audio file or a link to a YouTube video. The speech from audio/video is transribed to text. A plugin to scrape texts from websites is also provided. User can also do a recursive search based on obtained article abstracts .

Check provided wide_presentation.pdf for further information.

Currently supported databases

https://finna.fi/

https://doaj.org/

https://arxiv.org/

Dependencies

python >=3.5

Installation

git clone https://github.com/quantum-lizard-brains/scrapeomatic
cd scrapeomatic
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
cp src/wide_app/secrets_template.py src/wide_app/secrets.py
# see secrets.py, and replace any temp values with proper values

Running django dev server

In repository root:

source venv/bin/activate # if not already activated
cd src/
python manage.py runserver

Direct web browser to localhost:8000

About

Scrape-O-Matic is a semi-artificial intelligence driven scientific information explorer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 42.4%
  • JavaScript 28.4%
  • Vue 17.9%
  • HTML 9.5%
  • CSS 1.8%