Go to:

The aim of the Maths Information Retrieval research group at Masaryk University (MIR@MU) is to develop system enabling readers to cope with maths in digital libraries.

Přejít: navigation | start of page | end of page

News

2018-10-24: Petr Sojka and Vítek Novotný present our papers MIaS: Math-Aware Retrieval in Digital Mathematical Libraries and Implementation Notes for the Soft Cosine Measure at the CIKM 2018 ACM conference in Torino, Italy.

Petr Sojka presenting the paper MIaS: Math-Aware Retrieval in Digital Mathematical Libraries at the CIKM 2018 ACM conference Vítek Novotný presenting the paper Implementation Notes for the Soft Cosine Measure at the CIKM 2018 ACM conference

2018-08-07: Our short paper MIaS: Math-Aware Retrieval in Digital Mathematical Libraries (postprint) and Implementation Notes for the Soft Cosine Measure (postprint) were accepted to the CIKM 2018 ACM conference. See you in Torino!

2018-05-23: Vítek Novotný placed third with our paper Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines at the PhD Poster Session (FI MU).

PhD Poster Session (FI MU)

2018-01-29: Michal Růžička has defended his PhD thesis Math Information Retrieval for Digital Libraries.

2017-11-21: Xhulio Kondakçiu has joined our MIR@MU research team.

2017-11-01: Dávid Lupták has joined our MIR@MU research team.

2017-10-21: Michal Růžička presented our paper Flexible Similarity Search of Semantic Vectors Using Fulltext Search Engines at Hybrid Statistical Semantic Understanding and Emerging Semantics (HSSUES) workshop at ISWC 2017.

ISWC HSSEUS 2017: Audiance ISWC HSSEUS 2017: Speakers

2017-08-03: Vítek Novotný presents our paper Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines at 2nd Workshop on Representation Learning for NLP.

Repl4NLP 2017

2017-06-06: Preprint of our paper Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines is available via arXiv.org now.

2017-05-23: Our paper Semantic Vector Encoding and Similarity Search Using Fulltext Search Engines was accepted to the ACL 2017 workshop RepL4NLP 2017. See you in Vancouver!

Preprint will be available soon.

The method is implemented in ScaleText – a software tool for semantic document search under development as part of MIR@MU + RaRe TAČR Omega project TD03000295.

2016-12-02: RASLAN 2016 presentation on the design of ScaleText.

ScaleText is a software tool for semantic document search under development with RaRe Technologies as part of MIR@MU + RaRe TAČR Omega project TD03000295.

All news...

Přejít: navigation | start of page | end of page

Subprojects

Projects

Math Indexer and Searcher (MIaS)

MIaS is a maths-aware full-text based search engine by MIR@MU. MIaS is based on the state-of-the-art system Apache Lucene and accompanied by a web user interface WebMIaS.

Web User Interface for MIaS (WebMIaS)

WebMIaS is a web user interface enabling end users to use MIaS in a user-friendly way.

MathML Normalization (CanonMath)

Advanced MathML search engine working with MathML needs a tool for picking canonical representant of different forms of MathML coding of semantically equivalent formulae. We are developing such canonicalizer (primarily for MIaS).

Mathematical REtrieval Collection (MREC)

Integral part of search engine development is evaluation and testing. Among other data sets we prepared MREC based on arXMLiv — a project of Prof. Dr. Michael Kohlhase's group at Jacobs University Bremen.

Gensim – Similarity of Documents (DocSim)

Gensim is an open-source general-purpose software for scalable topic modelling. We have developed and use this tool for computing similarities between maths documents.

EuDML

MIR@MU team participated on the project of European Digital Mathematical Library (EuDML).

DML-CZ

MIR@MU team participated on the project of Czech Digital Mathematics Library (DML-CZ).

PdfToTextViaOCR

An open-source tool for image-based-PDF to text conversion developed as part of EuDML workflow.

PdfJbIm

An open-source tool for optimization and re-compression of PDF documents using standard JBIG2 compression developed as part of EuDML workflow.

Přejít: navigation | start of page | end of page

About Us

Go to: navigation | start of page | end of page