Our programmers are experienced specialists in natural language processing. We develop tools for facilitating human-computer interaction and allowing computers to process textual data. Here are some examples of what we do:
Text normalization
Text normalization is an automated process of converting a written text (featuring numbers, acronyms, symbols, abbreviations etc.) into a spoken-like form For example:
$200 - > two hundred dollars
Such normalization is the first essential step towards software allowing computers to read out a text containing numbers or abbreviations correctly. This complicated phrase could serve as an example for English:
to Mr. John Brown, PhD, OBE, holding passport No. BAC 1234567, residing at 123/2 Princes Str. in Edinburg, EH6-8NR, UK
How should this be read? This would be one way:
to doctor john brown, officer of the most excellent order of the british empire, holding passport number B A C one two three four five six seven, residing at one two three slash one pricess street in edinburgh, e h six, eight n r, united kingdom.
This is not as easy for a computer.
Our experience shows that normalization of this kind resembles machine translation. Here, the source language is the written word and the target language is speech. After necessary research, we came up with a normalization tool as a module of Translatica .
Blog-generating system (SEO-TOOL)
With a corpus of appropriate lexical data we can automatically generate blog entries. The generated texts are intelligible, have a gist and are not a random cluster of correctly inflected words. Our knowledge and tools allow us to generate blogs on any subject and with a desired saturation with keywords. The system can help you quickly and successfully create resources for website positioning or for marketing purposes.
If this is what you need, feel free to contact us.
Paraphrasing system (SEO-TOOL)
A key part of website positioning is the development of contents. We know how difficult it is to provide the right amount of text on a subject quickly and securely. This is why we developed a method for paraphrasing texts both in terms of vocabulary and syntax. From a source text, depending on the parameters and the desired difference level, we can generate up to about twenty texts diffrent from the original and each other on the same topic.
Swearing censorship
This group of tools allows to control forums, portals, public utility systems, etc. in terms of offensive contents. Our solutions can effectively replace human intervention. Depending on the version, the system controls words, expressions or contexts. We especially recommend the product for websites addressed to children.
Intelligent article summarising system
Quite often poral and bloge homepages contain article or entry summaries that are generated automatically. The most common algorithms for summarising leave much to be desired, as they are limited to the first sentence of a text, which is problematic if the first sentence begins with an abbreviation, such as Prof. Our summarising tools are intelligent: they are based on text normalization and syntactic analysis which allow them to precisely define a required portion of a text for summarising.