You are here
PC-KIMMO is a new implementation for microcomputers of a program dubbed KIMMO after its inventor Kimmo Koskenniemi (see Koskenniemi 1983). It is of interest to computational linguists, descriptive linguists, and those developing natural language processing systems. The program is designed to generate (produce) and/or recognize (parse) words using a two-level model of word structure in which a word is represented as a correspondence between its lexical level form and its surface level form.
KTEXT reads a text from a disk file, parses each word using the PC-KIMMO parser, and writes the results to a new disk file. This new file is in the form of a structured text file where each word of the original text is represented as a database record composed of several fields. Each word record contains a field for the original word, a field for the underlying or lexical form of the word, and a field for the gloss string.
For more information, use this contact.
The Phonology Template Editor and Search Tool was developed to assist with the phonological analysis of a Bantu language. The program:
- generates a phonology report;
- carries out various types of searches of phonetic data;
- serves as an interactive environment for writing and editing a phonology paper;
- generates an electronic publishable report (HTML formatted) from a generated phonology report;
- interfaces with Speech Analyzer;
- allows creation of user-defined phonology reports;
SUMMARY: WorldPad is a text editor that can display text in complex scripts using Graphite. It is an SIL product and is distributed as part of FieldWorks.
DETAILS: WorldPad is a basic text editor whose main distinction is the ability to display text in complex scripts using Graphite, a programmable rendering engine particularly suited to complex minority scripts. WorldPad can also be used to work with text in simple "Roman" scripts. Some of WorldPad's text-editing features include multilingual text processing, named styles, right-to-left paragraph...
SUMMARY: (Text Encoding Conversion kit) A toolkit for converting plain-text files from one encoding to another (e.g. Legacy to Unicode).
DETAILS: TECkit is a low-level toolkit intended to be used by other applications that need to perform encoding conversions (e.g., when importing legacy data into a Unicode-based application). The primary component of the TECkit package is therefore a library that performs conversions; this is the "TECkit engine." The engine relies on mapping tables in a specific binary format (for which documentation is available); there is a compiler that creates such tables from a human-readable mapping description (a simple text file).
Bibledit is a multi-platform Bible editor, providing tools to the Bible translator.
- Native USFM text editor, several texts can be opened at once
- Project notes editor
- Chapter and book outline view
- Resizeable and moveable windows
- Easy application of styles
- Styles editor
- Selection and display of project notes by various parameters
- Online help
- Backup and restore
- Import and export
- Resources display
- Spelling check ...
A basic need to do any language work on the computer is a font that will display the needed characters. Fonts that come with your operating system may have a certain number of characters already, but the SIL fonts frequently have a wider variety of characters.
The Latin/Cyrillic fonts differ mostly in the typeface. Charis, Doulos and Gentium are serif fonts. Andika is a sans-serif typeface designed for new readers with easily distinguished letter forms.
This tool makes it much easier to see the results of syntactic parses produced by the PC-PATR program (https://software.sil.org/pc-patr/). For each sentence, it shows the interlinear result and the syntactic tree. When one clicks on a node in the tree, it also shows the features at that node and the rule in the PC-PATR grammar file.
It is a .Net version 2.0 program so it requires a Windows operating system that has .Net version 2.0 or higher installed on it.