Content added Content deleted
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
⚫ | |||
"pstotext extracts text (in the ISO 8859-1 character set) from a PostScript |
"pstotext extracts text (in the ISO 8859-1 character set) from a PostScript |
||
or PDF (Portable Document Format) file. Thus, pstotext is similar to the |
or PDF (Portable Document Format) file. Thus, pstotext is similar to the |
||
Line 4: | Line 6: | ||
however better than that of ps2ascii, because pstotext deals better with |
however better than that of ps2ascii, because pstotext deals better with |
||
punctuation and ligatures." |
punctuation and ligatures." |
||
⚫ | |||
Also, take a look at [http://research.compaq.com/SRC/virtualpaper/pstotext.html this page]. |
Also, take a look at [http://research.compaq.com/SRC/virtualpaper/pstotext.html this page]. |
Revision as of 19:01, 26 July 2005
From the packet-description:
"pstotext extracts text (in the ISO 8859-1 character set) from a PostScript or PDF (Portable Document Format) file. Thus, pstotext is similar to the ps2ascii program that comes with ghostscript. The output of pstotext is however better than that of ps2ascii, because pstotext deals better with punctuation and ligatures."
Also, take a look at this page.