The content: function defaults to using a system defined iFilter for that specific filetype/extension.
This is basically for extracting the text part of the file.
For example a html file iFilter will roughly extract the text you see in a browser or preview pane, without all the tags etc.
(my assumption; didn't test it)
To bypass this behaviour, use the utf8content: function instead. This will treat the content as (UTF-8) plain text instead.
Depending on the encoding of your file, you mnight need to replace utf8content: with one of the following:
ansi-content:
ascii-content:
text-plain-content:
utf16-content:
utf16le-content:
utf16becontent:
This is basically for extracting the text part of the file.
For example a html file iFilter will roughly extract the text you see in a browser or preview pane, without all the tags etc.
(my assumption; didn't test it)
To bypass this behaviour, use the utf8content: function instead. This will treat the content as (UTF-8) plain text instead.
Depending on the encoding of your file, you mnight need to replace utf8content: with one of the following:
ansi-content:
ascii-content:
text-plain-content:
utf16-content:
utf16le-content:
utf16becontent:
Statistics: Posted by NotNull — Sat May 18, 2024 8:02 pm