Tag: Business vocabularies

Business vocabularies, also called lexicons, are a component of NLP processing. They are linguistic sets with grammatical and semantic information about words in a given language. One could think of lists of people, organizations, locations or products.

With Kairntech Studio you can import business vocabularies to create a text annotator (the official term is gazetteer).

Vocabularies can then be used to automatically annotate documents and help you jump-start the creation of an annotated dataset in order to train a Machine Learning model. Few-shot learning with Large Language Models may also be used in this context.

Vocabularies can be used either to annotate or to avoid annotations. The consolidation of different annotations and normalization processes are part of the AI Pipelines within the Kairntech solution.

For more detailed information:

How to import a vocabulary? – Kairntech Documentation

How to build a gazetteer? – Kairntech Documentation