Cognitionis
Hector Llorens Portfolio

Charniak how-to


Charniak is one of the best morfosyntactic parsers for English since 2005.

The distribution is precompiled for 32-bits. For 64bits RECOMPILE: en parser05May26fixed

COMPILATION ————————————————-

If 64-bit: You may need to install lib32 lib32std … rm -rf *.o

Both 32 and 64-bit: Since Ubuntu 9.10 you may also need to manually download and install (dpkg -i) getlibs-all.deb and libstdc++5_3.3.6-17ubuntu1_i386.deb.

make

If you have compilation errors see BchartSm.C (insert #include “GotIter.h”) and AnsStrCounts.C (insert #include “AnswerTree.h”   and   #include “math.h”).

make parseIt

——————————————————————–

REMEMBER: chmod a+x CharniakServer/parser05May26fixed/parseIt

cd charniakserver

vim charniak-server.pl –> update $charniakDir with current path

To execute the server: sudo ./ejecuta_charniak.sh  (parseIt process)

To kill the server: ./kill_charniak.sh

To test it: execute prueba_charniak.sh (or echo “…” | ./charniak-client.pl) or telnet localhost 4449 –> input sentence + enter

Charniak does not split sentences. It needs one sentence per line. (HOWEVER SRL tool splits sentences perfectly)

Works in UTF-8.