Choosing the database
The first interface screen allows to choose among available databases and to tune the interface parameters.
The main choice field lets you specify the name of a particular database.
If you have a right to modify databases on-line, you may use the "Login" button. You will need to have a registered user name and password to do that. If you do not have those requisites, you will still be able to view all the databases in read-only mode.
Several checkboxes below allow to switch viewing parameters.
The "Recoding" checkbox regulates the method of displaying special characters. Lexical data is stored within the database in phonetic transcription, utilising a special encoding system and special fonts (Windows fonts may be downloaded here; Macintosh fonts with the coding table Western MacRoman - HERE). If you do not use these fonts or standard Unicode fonts (on Unicode see below), the program may render special symbols by means of standard ASCII characters using a special encoding system. In this case you must check the "Recoding" checkbox.
The "Graphics as gifs" checkbox regulates the way of displaying doublebyte (Chinese, Greek, Arabic etc.) characters. By default characters are displayed as pictures in the GIF format. However, if the needed fonts are installed on your computer you may cancel the generation of GIFs. Remember: for Chinese you will need fonts in the BIG5 coding system; for Greek or Arabic you should install the fonts contained in TIMESTR.ZIP.
The "Use tables" checkbox allows you to turn off tables in the program output: it may be useful if you prefer a text browser (like Lynx).
The "Record numbers" checkbox allows you to watch the actual record numbers in the database or turn them off (which is the default option).
The "Multiple windows" checkbox lets you view links in separate windows or in the main browser window, whichever is more convenient for you.
After setting the above parameters press the "Submit query" button.
The small radiobuttons below allow you to choose one of the three generally
accepted encoding systems for Cyrillic (KOI, ALT and WIN) and to switch
between the English or Russian interface languages.
A recent addition (March 2001) is the possibility of using Unicode
encoding. Choose these options (UTF-RUS or UTF-ENG) if your browser has
Unicode capabilities and if a Unicode font is installed on your computer.
Note: So far only two browsers: Netscape 4 (and higher) and Internet
Explorer 4 (and higher) support Unicode. Unicode fonts also behave differently.
Times New Roman Star is a Unicode font and will display most symbols properly,
but has Cyrillic letters instead of some of the Unicode IPA symbols. Chinese
fonts like UWCXMF or Mingliu are OK for Chinese Unicode characters, but
lack most of the IPA symbols. Code2000, Lucida Sans Unicode and Titus Cyberbit,
on the other hand, do not include most of the CJK (Chinese, Japanese and
Korean) symbols. So far the most comprehensive font available is Arial
Unicode MS.
Press the "Change" button to enable the chosen encoding.
Querying a database
The main part of this screen is a table. By filling in its slots you specify which database fields you would like to include into your query and their contents. Every slot of the table corresponds to a field in the database.
By setting the flag "Include into report" close to a database field name you specify that this field should be included into the query result. If no field is specified all the database fields will be automatically included. The "Value" slots let you specify your requirements for the field value. The requirements are input as strings and the way the strings are interpreted by the query engine is regulated by the "Query method" slots. The following query methods are supported.
Match beginning
Match substring
Like beginning
Cyrillic letters may be entered as Roman surrounded by square brackets: [golova]
Greek letters also may be entered as Roman surrounded by figure braces, but the opening brace should be followed by the letter G: {Gkata}
Wildcard characters are allowed (but so far not within the Chinese text): the ? sign replaces any single character; the @ sign - any sequence of characters within a word.
Entering queries without recoding is at your own risk: the result will depend on the current code page of your browser, on your country settings etc.
If you use the UTF (Unicode) encoding, the wildcards and bracketed symbols like {k.} will be correctly processed only if the whole sequence starts with figure brackets.
A. Without Unicode
1) TwinBridge (or a similar system) should be installed on your computer. The coding system used in the databases is Big5, so make sure you choose the Big5 (not GB or anything else) coding in your program.
2) The appropriate font should be set in your browser's preferences.
If this is done, you should click on the menu option (in the lower part of the screen): Graphics: As fonts. You shall be able to enter queries containing characters using your current keyboard layout.
If your operating system and browser are Unicode-compatible and you have an IME (Input method) system installed, you may enter Chinese characters and other Unicode symbols in your query (but wildcard search is so far disabled).The databases presented here had been developed for several years by the Department of Comparative Linguistics and Ancient Languages of the Russian State University of the Humanities. The Chinese, Sino-Tibetan, North Caucasian and Yenisseian databases were compiled by S. A. Starostin; the Altaic database - by S. A. Starostin, O. A. Mudrak and A. V. Dybo; the Dravidian database - by G. S. Starostin; the Chukchee-Kamchatkan database - by O. A. Mudrak.
The computer database and interface systems were developed by S. A. Starostin and Yu. Bronnikov on the basis of S. A. Starostin's STARLING system, using the Clipper, C/C++ and TCL computer languages.