LanguageTools Unknown Word Lemmatizer:
Installation and Integration

Quick Demo Start

After installing the license, simply run one of the delivered scripts: 'start-demo.sh' (under Unix based systems), or 'start-demo.bat' (under Windows systems).

API Documentation

Detailed API information is available through the corresponding javadoc documentation.

Delivered Documents

  • adhocLemmatizer.jar (archive file containing code)
  • GerNAdHoc-surface.fsa (Compiled surface information)
  • GerNAdHoc-surface.tab (Feaure table)
  • GerNAdHoc-rules.tra (Compiled rules information)
  • GerNAdHoc-rules.tab (Feature table)
  • GerNAdHoc-rules.triple (Help table)
  • GerNAdHoc-citsuffix.tra (Suffix transducer)
  • GerNAdHoc-lemmatizer.tra (Compiled lexicalized data)
  • GerNAdHoc-lemmatizer.tab (Feature table)
  • GerNAdHoc-postfilters.xml (Compiled lexicalized data)
  • com/canoo/wmtrans/adhoclemmatizer/IntegrationDemo.java (source file used to test and show the integration in your own code)

Please note: In these file names, the N in the string GerN will be either 1 (if you use the evaluation version of this product) or 3 (if you use the full version).

License Installation

If you are using a licensed version, extract the file "license.cfg" from the received license archive "license.zip" and move it into the same directory as the jar file.

Proxy Configuration

If you have a proxy server, configure the Unknown Word Lemmatizer using the delivered ProxyConf program, which will generate the file "proxy.cfg" in the same directory as the jar file. Anonymous and basic authentication protocols are supported.

To call ProxyConf:

   java -cp 
        adhocLemmatizer.jar com.canoo.wmtrans.util.ProxyConf

Alternatively, you can provide following customized System properties:

  • http.proxyHost
  • http.proxyPort
  • http.proxyPlainAuthentication

Installing and Running Integration Test

To install and run the integration test:

  • Compile the integration test file:
    javac -classpath .:adhocLemmatizer.jar 
          com/canoo/wmtrans/adhoclemmatizer/IntegrationDemo.java
    
  • Run the integration test program:
    java -cp .:adhocLemmatizer.jar 
          com.canoo.wmtrans.adhoclemmatizer.IntegrationDemo
        GerNAdHoc-surface.fsa  GerNAdHoc-surface.tab 
        GerNAdHoc-rules.tra GerNAdHoc-rules.tab GerNAdHoc-rules.triple
        GerNAdHoc-citsuffix.tra 
        GerNAdHoc-lemmatizer.tra GerNAdHoc-lemmatizer.tab
        GerNAdHoc-postfilters.xml   
    

    (Please note that Unix/Linux and Windows use different classpath seperators:
    Unix/Linux => ':', Windows => ';')