EuDML project
The goal of EuDML project is to build and operate an European Digital Mathematical Library. The project is co-funded by EC. It started on 1 February 2010 and sucessfully finished officially on 31 January 2013.
MIR group at Masaryk University participated on the project in almost all workpackages (WP). We concentrated on solving work packages
- WP7 “Metadata enhancer toolset implementation” (recompressing PDFs,...),
- WP5 “Metadata repository and search implementation“ (Indexing and Search – MIaS/WebMIaS),
- WP8 “Association analyser implementation“ (similarity – gensim),
- WP9 “Annotation component implementation“ (VB, web interface implementation) and
- WP10 “Accessibility component toolset implementation“ (Braille MathML drivers via UMCL).
Go to: navigation | start of page | end of page
Demos
Live demonstrations of selected technologies is available:
Go to: navigation | start of page | end of page
Relevant projects
As part of the EuDML workflow, MIR@MU team has developed several tools:
Math Indexer and Searcher (MIaS) – a maths-aware full-text based search engine.
Optimization of PDF documents – an open-source tool for optimization and recompression of PDF documents using standard JBIG2 compression.
PdfToTextViaOCR – an open-source tool for image-based-PDF to text conversion.
Gensim – an open-source software for scalable topic modelling used to find similar documents
Cite as
Text
WOJCIECHOWSKI, Krzys, Petr SOJKA, Nicolas HOUILLON, Michal RŮŽIČKA, Radim HATLAPATKA, Vlastimil KREJČÍŘ, Miroslav HRDINA, Jiří SOCHOR, Pavel RYCHLÝ, Aleš HORÁK, Alan SEXTON, Gilberto PEDROSA, Franck LONTIN, Thierry BOUCHE and Maciej KOŁUDA. Toolset for Image and Text Processing and Metadata Enhancements — Final Release: Deliverable 7.4 of project EuDML. As of 9th February 2013. EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2013. 24 pp. Deliverable D7.4.
BibTeX
@misc{eudml:d7.4, author = "Krzys Wojciechowski and Petr Sojka and Nicolas Houillon and Michal Růžička and Radim Hatlapatka and Vlastimil Krejčíř and Miroslav Hrdina and Jiří Sochor and Pavel Rychlý and Aleš Horák and Alan Sexton and Gilberto Pedrosa and Franck Lontin and Thierry Bouche", title = "{Toolset for Image and Text Processing and Metadata Enhancements -- Final Release}", year = 2012, month = Mar, note = {Deliverable D7.4 of EU CIP-ICT-PSP project 250503 \href{http://project.eudml.eu/} {EuDML: The European Digital Mathematics Library}}, url = {https://project.eudml.org/sites/default/files/D7.4.pdf}, }
Go to: navigation | start of page | end of page
Přejít: navigation | start of page | end of page
Selected presentations
Michal Růžička: [Meta]data acquisition and validation
Radim Hatlapatka: PDF re-compression using JBIG2
Zuzana Nevěřilová: Metadata Processing
Miha Filej: DML editor I18n
Radim Hatlapatka: PDF Enhancements Tools
Michal Růžička, Petr Kovář: Metadata Editor
Martin Líška: Mathematical Indexing and Querying
Zuzana Nevěřilová: Visual Browser 4 Math – use cases
Go to: navigation | start of page | end of page
Selected Publications
The full list of our EuDML related publications can be found at the project page of our university site.
- LEE, Mark, Petr SOJKA, Radim ŘEHŮŘEK, Radim HATLAPATKA, Maroš KUCBEL, Thierry BOUCHE, Claude GOUTORBE, Romeo ANGHELACHE and Krzysztof WOJCIECHOWSKI. Toolset for Entity and Semantic Associations – Final Release: Deliverable 8.4 of project EuDML. 1.0 as of 8th February 2013. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2013. 13 s. Deliverable D8.4.
- ANGHELACHE, Romeo, Petr SOJKA, Vlastislav DOHNAL, Radoslav PAVLOV, Georgi SIMEONOV, Michael JOST, Klaus KIERMEIER, Lucia SANTAMARIA LARA, Helena MIHALJEVIC-BRANDT, Aleksandar PEROVIC, Olaf TESCHKE, Aleksander NOWINSKI, Jean-Luc ARCHIMBAUD, Krzysztof WOJCIECHOWSKI, Brigitte BIDEGARAY-FESQUET, Yves LAURENT, Thierry BOUCHE and Julien PUYDT. EuDML Assessment and Evaluation — Final Report: Deliverable 11.4 of project EuDML. 2.22 as of 12th April 2013. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2013. 34 s. Deliverable D11.4.
- WOJCIECHOWSKI, Krzys, Petr SOJKA, Nicolas HOUILLON, Michal RŮŽIČKA, Radim HATLAPATKA, Vlastimil KREJČÍŘ, Miroslav HRDINA, Jiří SOCHOR, Pavel RYCHLÝ, Aleš HORÁK, Alan SEXTON, Gilberto PEDROSA, Franck LONTIN, Thierry BOUCHE and Maciej KOŁUDA. Toolset for Image and Text Processing and Metadata Enhancements — Final Release: Deliverable 7.4 of project EuDML. 1 as of 9th February 2013. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2013. 24 s. Deliverable D7.4.
- BOUCHE, Thierry, Claude GOUTORBE, Nicolas HOUILLON, Jean-Paul JORDA, Romeo ANGHELACHE, Vittorio COTI ZELATI, Vlastimil KREJČÍŘ and Petr SOJKA. Final report on external imported metadata: Deliverable 3.5 of project EuDML. 1.1 as of 30th January 2013. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2013. 23 s. Deliverable D3.5.
- HATLAPATKA, Radim. JBIG2 Supported by OCR. In Petr Sojka, Michael Kohlhase. DML 2012: Towards a Digital Mathematics Library. Brno: Masaryk University, 2012.
- BOUCHE, Thierry, Hugo MANGUINHAS, Claude GOUTORBE, Nicolas HOUILLON, Rosa DE LA VIESCA, Helena FERNANDEZ, Jean-Paul JORDA, Marie-Louise CHAIX, Katarzyna ZAMLYNSKA, Wojtek SYLWESTRZAK, Michael JOST, Radoslav PAVLOV, Jiří RÁKOSNÍK, Miroslav BARTOŠEK, Petr SOJKA, Ioannis KARYDIS and Thomas FISCHER. Report on available collections and metadata: Deliverable 3.1 of project EuDML. 1.6 as of 5th August 2010. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2010. 40 s. Deliverable D3.1.
- WOJCIECHOWSKI, Krzyś, Aleksander NOWIŃSKI, Petr SOJKA and Martin LÍŠKA. The EuDML Search and Browse Service - Final: Deliverable 5.3 of project EuDML. 1.2 as of 11th February 2013. EU: EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2013. 16 s. Deliverable D5.3.
- JOST, Michael, Thierry BOUCHE, Claude GOUTORBE, Jean-Paul JORDA, Miroslav BARTOŠEK, Peter STANCHEV and Michał POLITOWSKI. The EuDML metadata schema: Deliverable D3.2 of project EuDML. 1.6 as of 15th December 2010. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2010. 31 s. Deliverable D3.2.
- LEE, Mark, Petr SOJKA and Radim ŘEHŮŘEK. Toolset for Entity and Semantic Associations – Value Release: Deliverable 8.3 of project EuDML. 1.0 as of 31st May 2012. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2012. 12 s. Deliverable D8.3.
- LEE, Mark, Petr SOJKA, Volker SORGE, Josef BAKER, Wojtek HURY and Łukasz BOLIKOWSKI. Association Analyzer Implementation: State of the Art: Deliverable 8.1 of project EuDML. 1 as of 27th November 2010. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2010. 22 s. Deliverable D8.1.
- SOJKA, Petr, Krzysztof WOJCIECHOWSKI, Nicolas HOUILLON, Michal RŮŽIČKA and Radim HATLAPATKA. Toolset for Image and Text Processing and Metadata Enhancements — Value release: Deliverable 7.3 of project EuDML. 1.01 as of 8th March 2012. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2012. 32 s. Deliverable D7.3.
- SOJKA, Petr and Radim HATLAPATKA. Toolset for Image and Text Processing and Metadata Editing – Initial release: Deliverable 7.2 of project EuDML. 1.0 as of 1st March 2011. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2011. 25 s. Deliverable D7.2.
- SOJKA, Petr, Josef BAKER, Alan SEXTON and Volker SORGE. State of the Art of Augmenting Metadata Techniques and Technology: Deliverable 7.1 of project EuDML. 1.2 as of 2nd November 2010. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2010. 40 s. Deliverable D7.1.
- SOJKA, Petr. Workshop report: Deliverable D2.2 of EuDML project. 1.0 as of 11th January 2012. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2012. 24 s. Deliverable D2.2.
- SORGE, Volker, Mark LEE, Petr SOJKA and Alan P. SEXTON. State of the Art of Accessibility Tools: Deliverable D10.1 of project EuDML. 1.0 as of 28th February 2011. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2011. 25 s. Deliverable D10.1.
- HATLAPATKA, Radim. JBIG2 Supported by OCR. CEUR Workshop Proceedings, Aachen: Neuveden, 2012, roč. 921, October, s. 82-90. ISSN 1613-0073.
- BOUCHE, Thierry, Alexander NOWINSKI, Petr SOJKA and Volker SORGE. Le projet EuDML : état des lieux et premiers résultats (invited talk 1.3.2012, Colloquium UJF, Grenoble, FR). In Le colloquium a généralement lieu, Institut Fourier, Grenoble. 2012.
- LEE, Mark, Petr SOJKA, Radim ŘEHŮŘEK, Łukasz BOLIKOWSKI, Wojtek HURY and Volker SORGE. Toolset for Entity and Semantic Associations – Initial Release: Deliverable 8.2 of project EuDML. 1.0 as of 27th May 2011. : EU CIP-ICT-PSP project 250503 EuDML: The European Digital Mathematics Library, 2011. 12 s. Deliverable D8.2.
- ŘEHŮŘEK, Radim and Petr SOJKA. Gensim -- Statistical Semantics in Python. In EuroScipy 2011, Paris. 2011.
- SOJKA, Petr and Martin LÍŠKA. The Art of Mathematics Retrieval. In Matthew R. B. Hardy, Frank Wm. Tompa. Proceedings of the 2011 ACM Symposium on Document Engineering. Mountain View, CA, USA: ACM, 2011. s. 57--60, 4 s. ISBN 978-1-4503-0863-2. doi:10.1145/2034691.2034703.
- NEVĚŘILOVÁ, Zuzana. Metadata Visualization in Digital Libraries. In Stefan Gradmann, Francesca Borri, Carlo Meghini and Heiko Schuldt. Research and Advanced Technology for Digital Libraries International Conference on Theory and Practice of Digital Libraries, TPDL 2011. Berlin, Germany: Springer Verlag, Berlin-Heidelberg, 2011. s. 442-445, 4 s. ISBN 978-3-642-24468-1.
- BORBINHA, José, Thierry BOUCHE, Aleksander NOWIŃSKI and Petr SOJKA. Project EuDML--A First Year Demonstration. In James H. Davenport, William M. Farmer, Josef Urban, Florian Rabe. Intelligent Computer Mathematics Lecture Notes in Computer Science, 2011, Volume 6824/2011. Berlin / Heidelberg: Springer, 2011. s. 281--284, 3 s. ISBN 978-3-642-22672-4.
- SOJKA, Petr and Martin LÍŠKA. Indexing and Searching Mathematics in Digital Libraries -- Architecture, Design and Scalability Issues. In James H. Davenport, William M. Farmer, Josef Urban, Florian Rabe. Intelligent Computer Mathematics Lecture Notes in Computer Science, 2011, Volume 6824/2011. Berlin / Heidelberg: Springer, 2011. s. 228--243, 15 s. ISBN 978-3-642-22672-4. doi:10.1007/978-3-642-22673-1_16.
- SOJKA, Petr and Thierry BOUCHE. Proceedings of DML 2011 Towards a Digital Mathematics Library. Brno, Czech Republic: Masaryk University Press, 2011. 118 s. ISBN 978-80-210-5542-1.
- SOJKA, Petr and Thierry BOUCHE. DML 2011 workshop. 2011.
- HATLAPATKA, Radim and Petr SOJKA. Recompression of Bitmaps in PDF using JBIG2 format. 2010.
- SOJKA, Petr. From Bitmaps back to Brains: DML-CZ and EuDML Projects (invited talk 8.11.2010, University of Birmingham AI Seminar, UK). In University of Birmingham, School of Computer Science, Artificial Intelligence and Natural Computation. 2010.
- SYLWESTRZAK, Wojtek, José BORBINHA, Thierry BOUCHE, Aleksander NOWIŃSKI and Petr SOJKA. EuDML--Towards the European Digital Mathematics Library. In DML 2010 Towards a Digital Mathematics Library. Brno, Czech Republic: Masaryk University, 2010. s. 11--26, 16 s. ISBN 978-80-210-5242-0. SOJKA, Petr and Radim HATLAPATKA. Document Engineering for a Digital Library: PDF recompression using JBIG2 and other optimization of PDF documents. In Proceedings of MEMICS 2010 conference. Znojmo, Czech Republic: NOVPRESS s.r.o., 2010. s. 205. ISBN 978-80-87342-10-7.
- SOJKA, Petr and Radim HATLAPATKA. Document Engineering for a Digital Library: PDF recompression using JBIG2 and other optimization of PDF documents. In Proceedings of DocEng 2010 conference. Manchester, UK: ACM, 2010. s. 3-12, 10 s. ISBN 978-1-4503-0231-9. doi:10.1145/1860559.1860563.
- HATLAPATKA, Radim and Petr SOJKA. PDF Enhancements Tools for a Digital Library: pdfJbIm and pdfsign. In DML 2010 Towards a Digital Mathematics Library. First edition. Brno, Czech Republic: Masaryk University, 2010. s. 45-55, 11 s. ISBN 978-80-210-5242-0.
- NEVĚŘILOVÁ, Zuzana. Implementing Dynamic Visualization as an Alternative Interface to a Digital Mathematics Library. In Towards a Digital Mathematics Library, Proceedings. Brno: Masaryk University, 2010. s. 63-68, 6 s. ISBN 978-80-210-5242-0.