LingTranSoft.info

Software

You are here

TECkit

SUMMARY:  (Text Encoding Conversion kit)  A toolkit for converting plain-text files from one encoding to another (e.g. Legacy to Unicode).

DETAILS:  TECkit is a low-level toolkit intended to be used by other applications that need to perform encoding conversions (e.g., when importing legacy data into a Unicode-based application). The primary component of the TECkit package is therefore a library that performs conversions; this is the "TECkit engine." The engine relies on mapping tables in a specific binary format (for which documentation is available); there is a compiler that creates such tables from a human-readable mapping description (a simple text file).

WorldPad

SUMMARY: WorldPad is a text editor that can display text in complex scripts using Graphite.  It is an SIL product and is distributed as part of FieldWorks.

DETAILS: WorldPad is a basic text editor whose main distinction is the ability to display text in complex scripts using Graphite, a programmable rendering engine particularly suited to complex minority scripts. WorldPad can also be used to work with text in simple "Roman" scripts. Some of WorldPad's text-editing features include multilingual text processing, named styles, right-to-left paragraph...

Phonology Template Editor and Search Tool (PTEST)

The Phonology Template Editor and Search Tool was developed to assist with the phonological analysis of a Bantu language. The program:

  • generates a phonology report;
  • carries out various types of searches of phonetic data;
  • serves as an interactive environment for writing and editing a phonology paper;
  • generates an electronic publishable report (HTML formatted) from a generated phonology report;
  • interfaces with Speech Analyzer;
  • allows creation of user-defined phonology reports;
     

Ktagger

A part-of-speech tagger based on PC-KIMMO.

For more information, use this contact.

 

Ktext

KTEXT reads a text from a disk file, parses each word using the PC-KIMMO parser, and writes the results to a new disk file. This new file is in the form of a structured text file where each word of the original text is represented as a database record composed of several fields. Each word record contains a field for the original word, a field for the underlying or lexical form of the word, and a field for the gloss string.

For more information, use this contact.

 

PC-KIMMO

PC-KIMMO is a new implementation for microcomputers of a program dubbed KIMMO after its inventor Kimmo Koskenniemi (see Koskenniemi 1983). It is of interest to computational linguists, descriptive linguists, and those developing natural language processing systems. The program is designed to generate (produce) and/or recognize (parse) words using a two-level model of word structure in which a word is represented as a correspondence between its lexical level form and its surface level form.

PC-PARSE

PC-PARSE is an archive which contains a set of programs for performing morphological or syntactic analysis. PC-PARSE includes code for

PC-PATR

A unification-based syntactic parser available individually, but more commonly distributed as part of PC-PARSE.

For more information, use this contact.

Unicode Ccount

UnicodeCCount is a quick-and-dirty Unicode-aware replacement for Ccount, the character count utility. Written in Perl, the program is available both as the Perl source (requires Perl 5.8.1 or newer) and as a stand-alone Windows EXE.

 

As this program is distributed at no cost, the SIL individual who created it and upgrades it is unable to provide a commercial level of personal technical support. He is interested in hearing from users, however, and will try to resolve problems that are reported to him. You can send feedback to the author here.

 

MacAutoFormat

MacAutoFormat is a text formatting program which converts Standard Format text files to RTF format. These resulting files then automatically format when opened in a program with an RTF converter such as Microsoft Word.

For more information, use this contact.