The Teneo Platform currently offer Natural Language Processing (NLP) Capabilities and development of conversational AI solutions in 86 languages. This means that the Platform works with these languages, understands their alphabets and characters, understands what a word and a sentence are in each of those languages, and is able to normalize and pre-process text (e.g. tolerate misspellings and abbreviations).
More specifically it also means that the Teneo Platform supports creating Flows, Language Objects, Entities, and Classes; using the Teneo Teneo Linguistic Modelling Language for condition writing, creating Integrations and other global elements, and also the use of revisitability, flow stacks, dialogue history, and so on.
In the Teneo Platform, the NLP Capabilities are provided through the use of Input Processors (IPs) which pre-process inputs for the Teneo Engine to be able to perform different processes on them, such as normalization, tokenization, and spelling correction, or even number recognition and language detection.
Each language supported by the Teneo Platform has a chain of Input Processors that know how to process that particular language, most of the available languages share the Standard Input Processors Chain with different configuration schemes, while separate Input Processors Chains are available for Chinese, Japanese, Korean, and Turkish.
On top of the NLP Capabilities, further Natural Language (NL) Analyzers are available for a number of the languages. The NL Analyzers group together additional resources available in the Teneo Platform: these resources provide additional information about evaluated inputs, but they are not required for a language to function in the Teneo Platform. The Teneo NL Analyzers, among others, include support for understanding and interpretation of date and time expressions, Part-of-Speech (POS) and Morphological annotation, or recognition of Named Entities.
Last but not least, The Teneo Platform also offers Pre-built Knowledge in the formats of Teneo NLU Ontology and Semantic Networks together with Conversational Modules in a number of the supported languages; Pre-built Knowledge is considered relevant for any project domain and any type of solution because it enables developers to get up and running with a new solution quicker as it allows the project team to focus only on project or domain specific needs.
The below table provides an overview of the supported languages in the Teneo Platform.
Language | NLP Capabilities | NL Analyzers | Pre-built Knowledge |
---|---|---|---|
Afrikaans | ✔ | ||
Albanian | ✔ | ||
Amharic | ✔ | ||
Armenian | ✔ | ||
Azerbaijani | ✔ | ||
Basque | ✔ | ||
Belarusian | ✔ | ||
Bengali (Bangla) | ✔ | ||
Bosnian | ✔ | ||
Bulgarian | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
|
Catalan | ✔ | ||
Chinese (Mandarin) | ✔ | POS and Morphology ✔ Date&Time ✔ Sentiment ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Croatian | ✔ | ||
Czech | ✔ | Ontology and Semantic Network ✔ | |
Danish | ✔ | POS and Morphology ✔ Date&Time ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Dutch | ✔ | POS and Morphology ✔ Date&Time ✔ Sentiment ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
English | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ Offensive ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Esperanto | ✔ | ||
Estonian | ✔ | ||
Ewe | ✔ | ||
Faroese | ✔ | ||
Finnish | ✔ | ||
French | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Frisian | ✔ | ||
Galician | ✔ | ||
Georgian | ✔ | ||
German | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Greek | ✔ | ||
Gujarati | ✔ | ||
Hindi | ✔ | ||
Hungarian | ✔ | ||
Icelandic | ✔ | ||
Igbo | ✔ | ||
Indonesian | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
|
Irish | ✔ | ||
Italian | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Japanese | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Kazakh | ✔ | ||
Kinyarwanda | ✔ | ||
Kirundi (Rundi) | ✔ | ||
Korean | ✔ | POS and Morphology ✔ | |
Kyrgyz | ✔ | ||
Latvian | ✔ | ||
Lithuanian | ✔ | ||
Luxembourgish | ✔ | ||
Macedonian | ✔ | ||
Malagasy | ✔ | ||
Malay | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
|
Maltese | ✔ | ||
Marathi | ✔ | ||
Mongolian | ✔ | ||
Nepali | ✔ | ||
Norwegian | ✔ | Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Odia | ✔ | ||
Oromo | ✔ | ||
Papiamento | ✔ | ||
Polish | ✔ | ||
Portuguese (European/Brazilian) | ✔ | Date&Time ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
Quechuan (Quechua) | ✔ | ||
Romanian | ✔ | ||
Romansh | ✔ | ||
Russian | ✔ | Ontology and Semantic Network ✔ Conversational Modules ✔ |
|
Sámi | ✔ | ||
Sango | ✔ | ||
Scottish Gaelic | ✔ | ||
Serbian | ✔ | ||
Shona | ✔ | ||
Sinhala | ✔ | ||
Slovak | ✔ | ||
Slovene | ✔ | ||
Somali | ✔ | ||
Spanish | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Swahili (Kiswahili) | ✔ | ||
Swazi | ✔ | ||
Swedish | ✔ | POS and Morphology ✔ NERs ✔ Date&Time ✔ Sentiment ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Tigrinya | ✔ | ||
Tsonga | ✔ | ||
Tswana (Setswana) | ✔ | ||
Turkish | ✔ | POS and Morphology ✔ NERs ✔ |
Ontology and Semantic Network ✔ Conversational Modules ✔ |
Turkmen | ✔ | ||
Ukrainian | ✔ | ||
Uzbek | ✔ | ||
Vietnamese | ✔ | ||
Welsh | ✔ | ||
Yoruba | ✔ | ||
Zulu (isiZulu) | ✔ |
The Teneo Platform offers Pre-built Knowledge in the formats of Teneo NLU Ontology and Semantic Networks together with Conversational Modules in a number of the supported languages; the Pre-built Knowledge is considered relevant for any project domain and any type of solution because it enables developers to get up and running with a new solution quicker as it allows the project team to focus only on project or domain specific needs.
The Teneo NLU Ontology and Semantic Networks are Lexical Resources containing off-the-shelf building blocks to be used for building Natural Language Interaction solutions in Teneo Studio and are structured and designed to facilitate the understanding of a specific language. For some of the languages the Lexical Resources also contain Additional Resources in the form of objects which support the functionalities of the NL Analyzers, this may be, for example, Morphological ANNOT Language Objects, Named Entity Objects or even objects to support the understanding and interpretation of date and time expressions.
The Conversational Modules are Dialogue Resources or template solutions which provide more than simple smalltalk or chitchatting Flows; they provide the generic dialogue capabilities needed for the conversational AI application to behave human-like, show social skills and personalized behavior, and even for it to simulate intelligence and in this way, engage with the end-users such that they trust the capabilities of the application.
The Teneo NLU Ontology and Semantic Networks are distributed as Lexical Resources which can be assigned to solutions in Teneo Studio, while the Conversational Modules are distributed as template solutions available upon creation of new solutions in Studio.
The below table lists languages currently available with Pre-built Knowledge, and - where applicable - any Additional Resources available (providing support to the NL Analyzers).
Language | Teneo NLU Ontology and Semantic Network | Conversational Module | Additional Resources Included as objects in the Lexical Resource |
---|---|---|---|
Bulgarian | Lexical Resource ✔ | Dialogue Resource ✔ | |
Chinese (Mandarin) | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Date&Time Objects ✔ Sentiment Analysis Objects ✔ |
Czech | Lexical Resource ✔ | ||
Danish | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Date&Time Objects ✔ |
Dutch | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Date&Time Objects ✔ Sentiment Analysis Objects ✔ |
English | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ Date&Time Objects ✔ Sentiment Analysis Objects ✔ Offensive Language Detector Objects ✔ |
French | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ Date&Time Objects ✔ |
German | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Date&Time Objects ✔ Sentiment Analysis Objects ✔ |
Indonesian | Lexical Resource ✔ | Dialogue Resource ✔ | |
Italian | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ Date&Time Objects ✔ Sentiment Analysis Objects ✔ |
Japanese | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ Date&Time Objects ✔ |
Malay | Lexical Resource ✔ | Dialogue Resource ✔ | |
Norwegian | Lexical Resource ✔ | Dialogue Resource ✔ | Date&Time Objects ✔ |
Portuguese (European and Brazilian) | Lexical Resource ✔ | Dialogue Resource ✔ | Date&Time Objects ✔ |
Russian | Lexical Resource ✔ | Dialogue Resource ✔ | |
Spanish | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ Date&Time Objects ✔ |
Swedish | Lexical Resource ✔ | Dialogue Resource ✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ Date&Time Objects ✔ Sentiment Analysis Objects ✔ |
Turkish | Lexical Resource ✔ | Dialogue Resource✔ | Morphology ANNOT objects ✔ Named Entity ANNOT objects ✔ |
Was this page helpful?