Projects/Nepomuk/FileIndexing
This page attempts to catalogue the list of files formats Nepomuk supports, and what formats are remaining.
Mime Types
MimeType | Status | Plugin | Comments |
---|---|---|---|
image/jpeg | Testing | Exiv2Extractor | No Comments |
image/png | Testing | Exiv2Extractor | - |
image/gif | ? | ? | |
image/exif | |||
image/tiff | |||
image/bmp | |||
image/svg | |||
audio/mpeg | Requires Polish | Taglib Extractor | |
audio/mp4 | |||
audio/wav | |||
audio/x-aiff | |||
application/pdf | Implemented - Requires Testing | PopplerExtractor | --- |
Other Office Formats | ? | ||
Ebook Formats | ? | ||
Archives | ? | ||
video/mpeg | Testing | FFmpeg | |
video/x-msvideo | Testing | FFmpeg | |
Other video formats | ? | ||
text/plain | Plain Text Extractor | Implemented | This should be extended to support other text files |
Notes
Documents
Microsoft Formats
DOC - OLE 2 Compound Document and Office Open XML - Custom parser by Strigi. What can we use? <br\> XSL - http://qt-project.org/wiki/Handling_Microsoft_Excel_file_format <br\> spreadsheet formats <br\>
Maybe we can use some libreoffice or calligra libraries?
Open document formats
ODF - Strigi had their own inbuilt. What are our options?
Ebook formats
- epub - Strigi reuses their ODF parser for epub. We could use libepub
- mobi
- rtf
- lrf
Checkout what Okular uses for all these files and use that.
Other
- lyx
- tex
- cbz - Comic books
Archives
We just need to add the nfo:Archive type based on the mimetype. Is there anything else that we can add?
Emails
- mbox format - How? Something from pim?