< Projects | Nepomuk Revision as of 17:01, 10 September 2012 (view source)Unormal (talk | contribs) (Corrected the item lists)← Older edit Revision as of 17:10, 10 September 2012 (view source) Vhanda (talk | contribs) (→Videos)Newer edit → Line 33: Line 33: == Videos == == Videos == + +Strigi uses ffmpeg except for ID3, vorbis and OggS. It also has to seek through the file. Not sure what that is for. + +Overall, we could just use ffmpeg for everything. It's very fast and pretty much supports all the formats. == Audio == == Audio == Revision as of 17:10, 10 September 2012 Nepomuk currently acts as the file indexer for the KDE platform, applications and workspaces. Even though we frequently tout that we are not just a file indexer, we need to index the files properly. Contents 1 File indexing solutions 1.1 Strigi 1.2 Roll our own? 2 File Formats 2.1 Images 2.2 Videos 2.3 Audio 2.4 Documents 2.5 Archives 2.6 Emails 2.7 Text Files 2.8 ISO images 2.9 Executable files File indexing solutions Strigi The KDE software releases in version 4.9, currently use libstreamanalyzer to index the files. Current problems with strigi - Difficult to contribute to No documentation Un-maintained Does not reuse libraries Lists the current status of indexing different files. Roll our own? File Formats We list down all the different file formats, and which all are supported by the different file indexing solutions. Images JPEG - Use exiv - strigi also uses exiv - currently broken PNG - Strigi rolls its own - detects the application name, color depth and interlace mode as well GIF - there isn't much metadata EXIF TIFF BMP SVG - Strigi stores them as plain text We just use exiv2 and cover almost everything. Plus the code would be super simple. Videos Strigi uses ffmpeg except for ID3, vorbis and OggS. It also has to seek through the file. Not sure what that is for. Overall, we could just use ffmpeg for everything. It's very fast and pretty much supports all the formats. Audio MP3 Documents doc docx odf pdfs epub mobi spreadsheet formats presentation Formats lyx tex cbz - Comic books Archives tar gzip whatever .. Emails There was a bug report Text Files Text files Source Code ISO images Executable files Retrieved from "https://community.kde.org/index.php?title=Projects/Nepomuk/FileIndexing&oldid=23989" Content is available under Creative Commons License SA 4.0 unless otherwise noted.