This page attempts to catalogue the list of files formats Nepomuk supports, and what formats are remaining.
|audio/mpeg||Requires Polish||Taglib Extractor|
|application/pdf||Implemented - Requires Testing||PopplerExtractor||---|
|Other Office Formats||?|
|Other video formats||?|
|text/plain||Plain Text Extractor||Implemented||This should be extended to support other text files|
DOC - OLE 2 Compound Document and Office Open XML - Custom parser by Strigi. What can we use? <br\> XSL - http://qt-project.org/wiki/Handling_Microsoft_Excel_file_format <br\> spreadsheet formats <br\>
Maybe we can use some libreoffice or calligra libraries?
Open document formats
ODF - Strigi had their own inbuilt. What are our options?
- epub - Strigi reuses their ODF parser for epub. We could use libepub
Checkout what Okular uses for all these files and use that.
- cbz - Comic books
We just need to add the nfo:Archive type based on the mimetype. Is there anything else that we can add?
- mbox format - How? Something from pim?