Language Support#
The search feature of an enterprise GenAI platform is crucial for efficient information retrieval. Language support is essential to support diverse user bases, enabling users to search and retrieve information in their preferred language and enhancing the overall user experience.
The platform supports multiple languages by default, as outlined in the following table. However, out of the box, it is optimized for single-language projects in English. Squirro recommends removing the extractive-qa
and cross-encoder
scoring profiles from the default configuration for multi-lingual projects to improve efficiency. For detailed information, to learn more about scoring profiles and how to configure them, see the How to Use Scoring Profiles to Customize Document Relevancy Scoring page.
Language |
ISO code |
Keyword Search |
Semantic Search |
---|---|---|---|
Afrikaans |
af |
||
Albanian |
sq |
||
Amharic |
am |
||
Arabic |
ar |
||
Armenian |
hy |
||
Assamese |
as |
||
Azerbaijani |
az |
||
Basque |
eu |
||
Belarusian |
be |
||
Bengali |
bn |
||
Bengali Romanize |
n.a. |
||
Bosnian |
bs |
||
Breton |
br |
||
Bulgarian |
bg |
||
Burmese |
my |
||
Burmese zawgyi font |
n.a. |
||
Catalan |
ca |
||
Chinese |
zh |
||
Chinese (Simplified) |
n.a. |
||
Chinese (Traditional) |
n.a. |
||
Croatian |
hr |
||
Czech |
cs |
||
Danish |
da |
||
Dutch |
nl |
||
English |
en |
||
Esperanto |
eo |
||
Estonian |
et |
||
Filipino |
n.a. |
||
Finnish |
fi |
||
French |
fr |
||
Galician |
gl |
||
Georgian |
ka |
||
German |
de |
||
Greek |
el |
||
Gujarati |
gu |
||
Hausa |
ha |
||
Hebrew |
he |
||
Hindi |
hi |
||
Hindi Romanize |
n.a. |
||
Hungarian |
hu |
||
Icelandic |
is |
||
Indonesian |
id |
||
Irish |
ga |
||
Italian |
it |
||
Japanese |
ja |
||
Javanese |
jv |
||
Kannada |
kn |
||
Kazakh |
kk |
||
Khmer |
n.a. |
||
Korean |
ko |
||
Kurdish (Kurmanji) |
ku |
||
Kyrgyz |
n.a. |
||
Lao |
lo |
||
Latin |
la |
||
Latvian |
lv |
||
Lithuanian |
lt |
||
Macedonian |
mk |
||
Malagasy |
mg |
||
Malay |
ms |
||
Malayalam |
ml |
||
Marathi |
mr |
||
Mongolian |
mn |
||
Nepali |
ne |
||
Norwegian |
no |
||
Oriya |
or |
||
Oromo |
om |
||
Pashto |
ps |
||
Persian |
fa |
||
Polish |
pl |
||
Portuguese |
pt |
||
Punjabi |
pa |
||
Romanian |
ro |
||
Russian |
ru |
||
Sanskrit |
sa |
||
Scottish Gaelic |
gd |
||
Serbian |
sr |
||
Sindhi |
sd |
||
Sinhala |
si |
||
Slovak |
sk |
||
Slovenian |
sl |
||
Somali |
so |
||
Spanish |
es |
||
Sundanese |
su |
||
Swahili |
sw |
||
Swedish |
sv |
||
Tamil |
ta |
||
Tamil Romanize |
n.a. |
||
Telugu |
te |
||
Telugu Romanize |
n.a. |
||
Thai |
th |
||
Turkish |
tr |
||
Ukrainian |
uk |
||
Urdu |
ur |
||
Urdu Romanize |
n.a. |
||
Uyghur |
n.a. |
||
Uzbek |
uz |
||
Vietnamese |
vi |
||
Welsh |
cy |
||
Western Frisian |
n.a. |
||
Xhosa |
xh |
||
Yiddish |
yi |