Topic

Using Non-Latin Script Languages in Search Queries

< 1 min read

When building search queries in MT Connect, it’s important to understand how different languages are processed—especially those that use non-Latin scripts. Some of these languages do not use spaces to separate words, which affects how search terms are interpreted by the system.

For best results, enclose words or phrases in double quotes (e.g., “世界”) to ensure they’re treated as a complete unit.

Below is a list of commonly supported non-Latin script languages and the script each one uses:

LanguageScript Used
Chinese (Simplified/Traditional)Han (CJK)
JapaneseKanji + Kana
KoreanHangul
ArabicArabic
HebrewHebrew
RussianCyrillic
GreekGreek
HindiDevanagari
ThaiThai
TamilTamil
BengaliBengali
GeorgianGeorgian
ArmenianArmenian
Persian (Farsi)Arabic
UrduArabic
VietnameseLatin + tone marks

Important Tips #

  • Always enclose non-Latin keywords in double quotes:
    Example: “こんにちは” for Japanese or “שלום” for Hebrew
  • For Vietnamese, although it uses Latin script, tone marks affect search behavior, so double quotes are also recommended.

NOTE:  If double quotes are not used, the system may treat individual characters as separate search terms, leading to inaccurate results.

For more help with multilingual queries, visit our article: Creating effective search queries or contact [email protected] using “Technical” as the subject.