Skip to content

dengpenn/Math-Formula-Retrieval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Math Formula Retrieval

This repo contains front-end and backend of math formula retrieval system including text and formula index building and searching.

Project Structure

  • Math-search-front includes front-end of our system using Angular JS.
  • Lucene was used for building index and ranking documents(TF-IDF Model).
  • Math formula(Latex format) was parsed as XML and the tree sub-structure was built on it, detailed information can be found in formula-server.

Motivation

  • Mathematical formulae are important means for dissemination and communication of scientific information.
  • Non-alphabetical symbols that are not understood by current search systems.
  • Terms are either meaningless or improperly read and processed by current systems; e.g.
  • Current search systems are not equipped to recognize those structure when searching.

Method

  • We treat every formula inside documents as single indexed file.
  • The ranking algorithms is . . It indicated the matched level between the indexed formula and searched formula.

Demo

Searched Formula

image-20190427214756275

Matched Top3 Formula

Rank1

image-20190427214432442

Rank2

image-20190427214619160

Rank3

image-20190427214644396

Contributor

Deng Pan

Yuchen Deng

Sichao Xue

Reference

Gao, Liangcai, et al. "The Math Retrieval System of ICST for NTCIR-12 MathIR Task." NTCIR. 2016.

About

Math Formula Retrieval System

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •