User Tools

Site Tools


recognized_filetypes

This is an old revision of the document!


Recognized Filetypes

ParserMonster can read any of the following types of files:

  • Text
  • Rich text
  • Microsoft Excel spreadsheets
  • Microsoft Word documents
  • XML (currently, only as raw text)

In order to determine the reading method used, ParserMonster maintains a list of file extensions, mapped to one of the listed file types. the list is stored in a hidden file named cx-ExtensionMapDefaults. You can make another cx- file if you need to override or extend this list.

The only valid mapping keywords are as follows (ParserMonster can read the bold types):

  • bin (Binary)
  • doc (Microsoft Word documents)
  • img (Image)
  • rtf (Rich text) 1
  • txt (Text)
  • xls (Microsoft Excel spreadsheets)
  • xml (XML) 2
  • zip (Archive)
  • ??? (Unknown) 3

[1] Some non-standard and complex rich text documents can't be read properly. If possible, convert to Word document.

[2] The XML is not interpreted, ParserMonster sees it as raw text.

[3] ParserMonster treats unknown filetypes as plain text. It assumes you are trying to process text. Deliberately choosing non-text files of unknown type may slow down the program! (The exceptions are tasks that only process file names.)

Default Map

recognized_filetypes.1438145856.txt.gz · Last modified: 2015/07/29 04:57 by Mitch