Does this mean that when I search for "to be or not to be" everything will also search all those html tokens which may be in the middle of the phrase likeEverything will treat the following file types as plain text files: htm
Code: Select all
to <b>be</b> or <i>not</i> to <b>be</b>
I think html to text parsing can be easy, even with the help of external preprocessors like HTMLAsText from nirsoft.net
Because of various needs and industry conflicts I think that external preprocessors like pdftotext and calling other COM class eg. word, excel or any format showed up in the future will be a must.