pattern.shell
pattern.en parser
You can run the parser from the command-line. The pattern module should be installed (i.e., located in /site-packages, see installation instructions), or the current working directory should be the one that contains the pattern folder.
> python -m pattern.en.parser -f lewiscarroll.txt
If no options are given a full parse is executed (i.e. tokenization, tagging, chunking, relations and lemmata). Otherwise, you need to explicitly list every required option:
-O | --tokenize | Tokenize the input. |
-T | --tags | Parse part-of-speech tags. |
-C | --chunks | Parse chunks and PNP tags. |
-R | --relations | Find verb/predicate relations. |
-L | --lemmata | Find word lemmata. |
-I | --light | Disable contextual rules. |
-f | --file | Input filename. |
-s | --string | Input string. |
-e | --encoding | Specify character encoding (utf-8 by default). |
-v | --version | Current version of pattern. |
Short options can be concatenated: python -m pattern.en.parser -OTLI -f lewiscarroll.txt
pattern.nl parser
The Dutch parser works in the same way. Note the xml option (which produces XML output):
> python -m pattern.nl.parser xml -s "Het wordt steeds vreemder."
pattern.de parser
The German parser works in the same way.
> python -m pattern.de.parser xml -s "Ein Abenteuer im Wunderland!"