Parser Configuration

Content Parser Settings

With this settings you can activate or deactivate parsing of additional content-types based on their MIME-types.
For a detailed description of the various MIME-types take a look at http://www.iana.org/assignments/media-types/.
If you want to test a specific parser you can do so using the File Viewer.

Extension Mime-Type
Microsoft Visio Parser
vsd
vst
vss
vdx
vtx
application/visio
application/x-visio
application/vnd.visio
application/visio.drawing
application/vsd
application/x-vsd
image/x-vsd
zz-application/zz-winassoc-vsd
Tape Archive File Parser
tar
application/x-tar
application/tar
applicaton/x-gtar
multipart/x-tar
FreeMind Parser
mm
application/freemind
application/x-freemind
Streaming HTML Parser
htm
phtml
msg
phtm
tex
php4
php5
cfm
stm
php2
xhtml
php3
tpl
txt
aspx
php
html
shtml
shtm
asp
text/html
text/xhtml+xml
application/xhtml+xml
application/x-httpd-php
application/x-tex
application/vnd.ms-outlook
text/plain
text/csv
GNU Zip Compressed Archive Parser
gz
tgz
application/x-gzip
application/gzip
application/x-gunzip
application/gzipped
application/gzip-compressed
gzip/document
Microsoft Powerpoint Parser
pps
ppt
application/mspowerpoint
application/powerpoint
application/vnd.ms-powerpoint
application/ms-powerpoint
application/mspowerpnt
application/vnd-mspowerpoint
application/x-powerpoint
application/x-m
Torrent Metadata Parser
torrent
application/x-bittorrent
Word Document Parser
doc
application/msword
application/doc
appl/text
application/vnd.msword
application/vnd.ms-word
application/winword
application/word
application/x-msw6
application/x-msword
SVG Image Parser
svg
image/svg+xml
ZIP File Parser
zip
jar
apk
application/zip
application/x-zip
application/x-zip-compressed
application/x-compress
application/x-compressed
multipart/x-zip
application/java-archive
application/vnd.android.package-archive
RSS Parser
rss
xml
xml
text/rss
text/xml
application/rss+xml
application/atom+xml
Commodore 64 SID Audio File Parser
sid
audio/prs.sid
audio/psid
audio/x-psid
audio/sidtune
audio/x-sidtune
Microsoft Excel Parser
xla
xls
application/msexcel
application/excel
application/vnd.ms-excel
application/x-excel
application/x-msexcel
application/x-ms-excel
application/x-dos_ms_excel
application/xls
Link Scraper Parser
cpp
jsonp
c
jsp
h
js
json
mf
py
pl
application/json
application/x-javascript
text/javascript
text/x-javascript
text/x-json
text/sgml
Audio File Meta-Tag Parser
mp3
flac
oga
m4p
wma
m4a
ogg
audio/mpeg
audio/MPA
audio/mpa-robust
audio/mp4
audio/flac
audio/x-flac
audio/x-ms-wma
audio/x-ms-asf
Rich Text Format Parser
rtf
text/rtf
text/richtext
application/rtf
application/x-rtf
application/x-soffice
Bzip 2 UNIX Compressed File Parser
tbz2
bz2
tbz
application/x-bzip2
application/bzip2
application/x-bz2
application/x-bzip
application/x-stuffit
Acrobat Portable Document Parser
pdf
application/pdf
application/x-pdf
application/acrobat
applications/vnd.pdf
text/pdf
text/x-pdf
Metadata Image Parser
tif
psd
image/tiff
image/vnd.adobe.photoshop
image/x-photoshop
Comma Separated Value Parser
csv
vCard Parser
vcf
text/x-vcard
application/vcard
application/x-versit
text/x-versit
text/x-vcalendar
Android Application Parser
apk
application/vnd.android.package-archive
Generic Image Parser
tif
jpg
cur
tiff
ico
bmp
gif
png
wbmp
jpeg
rle
jpe
image/jpg
image/cursor
image/vnd.wap.wbmp
image/png
image/jpeg
image/x-cursor
image/bmp
image/gif
image/x-icon
image/x-bmp
image/vnd.microsoft.cursor
image/x-png
image/tiff
image/x-tiff
image/ico
image/vnd.microsoft.icon
PostScript Document Parser
ps
application/postscript
application/ps
application/x-postscript
application/x-ps
application/x-postscript-not-eps
7zip Archive Parser
7z
application/x-7z-compressed
Open Office XML Document Parser
dotx
pptx
xltx
ppsx
xlsx
potx
docx
application/vnd.openxmlformats-officedocument.wordprocessingml.document
application/vnd.openxmlformats-officedocument.wordprocessingml.template
application/vnd.openxmlformats-officedocument.presentationml.template
application/vnd.openxmlformats-officedocument.presentationml.slideshow
application/vnd.openxmlformats-officedocument.presentationml.presentation
application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
application/vnd.openxmlformats-officedocument.spreadsheetml.template
OASIS OpenDocument V2 Text Document Parser
otg
otp
odb
ott
odc
ots
odf
odg
sxw
odi
odm
odp
odt
ods
sxc
application/vnd.oasis.opendocument.text
application/vnd.oasis.opendocument.spreadsheet
application/vnd.oasis.opendocument.presentation
application/vnd.oasis.opendocument.graphics
application/vnd.oasis.opendocument.chart
application/vnd.oasis.opendocument.formula
application/vnd.oasis.opendocument.database
application/vnd.oasis.opendocument.image
application/vnd.oasis.opendocument.text-master
application/vnd.oasis.opendocument.text-template
application/vnd.oasis.opendocument.spreadsheet-template
application/vnd.oasis.opendocument.presentation-template
application/vnd.oasis.opendocument.graphics-template
application/x-vnd.oasis.opendocument.text
application/OOo-calc
application/OOo-writer
PDF Parser Attributes

This is an experimental setting which makes it possible to split PDF documents into individual index entries. Every page will become a single index hit and the url is artifically extended with a post/get attribute value containing the page number as value. When such an url is displayed within a search result, then the post/get attribute is transformed into an anchor hash link. This makes it possible to view the individual page directly in the pdf.js viewer built-in into firefox, for reference see https://github.com/mozilla/pdf.js/wiki/Viewer-options

Split PDF
Property Name