
[ Download ]
The word splitter is a simple script that reads plain text (one sentence per line) and outputs the words with spaces between every word and punctuation mark (this format is need by tools such as the POS-tagger).
Usage: word-splitter.pl inputFile [ > outputFile]