Language Support

Language Support#

The search feature of an enterprise GenAI platform is crucial for efficient information retrieval. Language support is essential to support diverse user bases, enabling users to search and retrieve information in their preferred language and enhancing the overall user experience.

The platform supports multiple languages by default, as outlined in the following table. However, out of the box, it is optimized for single-language projects in English. Squirro recommends removing the extractive-qa and cross-encoder scoring profiles from the default configuration for multi-lingual projects to improve efficiency. For detailed information, to learn more about scoring profiles and how to configure them, see the How to Use Scoring Profiles to Customize Document Relevancy Scoring page.

Language

ISO code

Keyword Search

Semantic Search

Afrikaans

af

Albanian

sq

Amharic

am

Arabic

ar

Armenian

hy

Assamese

as

Azerbaijani

az

Basque

eu

Belarusian

be

Bengali

bn

Bengali Romanize

n.a.

Bosnian

bs

Breton

br

Bulgarian

bg

Burmese

my

Burmese zawgyi font

n.a.

Catalan

ca

Chinese

zh

Chinese (Simplified)

n.a.

Chinese (Traditional)

n.a.

Croatian

hr

Czech

cs

Danish

da

Dutch

nl

English

en

Esperanto

eo

Estonian

et

Filipino

n.a.

Finnish

fi

French

fr

Galician

gl

Georgian

ka

German

de

Greek

el

Gujarati

gu

Hausa

ha

Hebrew

he

Hindi

hi

Hindi Romanize

n.a.

Hungarian

hu

Icelandic

is

Indonesian

id

Irish

ga

Italian

it

Japanese

ja

Javanese

jv

Kannada

kn

Kazakh

kk

Khmer

n.a.

Korean

ko

Kurdish (Kurmanji)

ku

Kyrgyz

n.a.

Lao

lo

Latin

la

Latvian

lv

Lithuanian

lt

Macedonian

mk

Malagasy

mg

Malay

ms

Malayalam

ml

Marathi

mr

Mongolian

mn

Nepali

ne

Norwegian

no

Oriya

or

Oromo

om

Pashto

ps

Persian

fa

Polish

pl

Portuguese

pt

Punjabi

pa

Romanian

ro

Russian

ru

Sanskrit

sa

Scottish Gaelic

gd

Serbian

sr

Sindhi

sd

Sinhala

si

Slovak

sk

Slovenian

sl

Somali

so

Spanish

es

Sundanese

su

Swahili

sw

Swedish

sv

Tamil

ta

Tamil Romanize

n.a.

Telugu

te

Telugu Romanize

n.a.

Thai

th

Turkish

tr

Ukrainian

uk

Urdu

ur

Urdu Romanize

n.a.

Uyghur

n.a.

Uzbek

uz

Vietnamese

vi

Welsh

cy

Western Frisian

n.a.

Xhosa

xh

Yiddish

yi