Someone apparently did observe ChatGPT (I think it was ChatGPT) switch to Chines...

laurent_du · 2025-04-15T13:41:41 1744724501

Reminds me of this funny video: https://www.youtube.com/watch?v=NY3yWXWjYjA ("You know something has gone wrong when he switches to Chinese")

ApolloFortyNine · 2025-04-15T13:10:59 1744722659

I've seen this happen as well with o3-mini, but I'm honestly not sure what triggered it. I use it all the time but have only had it switch to Chinese during reasoning maybe twice.

Telemakhos · 2025-04-15T13:57:13 1744725433

I've seen Grok sprinkle random Chinese characters into responses I asked for in ancient Greek and Latin.

andai · 2025-04-15T15:12:21 1744729941

I get strange languages sprinkled through my Gemini responses, including some very obscure ones. It just randomly changes language for one or two words.

genewitch · 2025-04-15T18:40:48 1744742448

Is it possible the "vector" is more accurate in another language? Like espirit d'esclair or schadenfreude, or any number of other things that are a single word in a language but paragraphs or more in others?

sanxiyn · 2025-04-15T22:57:25 1744757845

Possibly. I have seen Claude switching to Russian for a word or two when it is about revolution!

numpad0 · 2025-04-15T20:04:29 1744747469

Isn't it just it getting increasingly incoherent as non-English data fraction increases?

Last I checked, none of open weight LLMs has languages other than English as its sole dominant language represented in the dataset.

rzz3 · 2025-04-15T16:53:01 1744735981

I saw Claude 3.7 write a comment in my code in Russian followed by, likely from a previous modification, the English text “Russian coding” for no reason.

maxloh · 2025-04-15T13:18:08 1744723088

> the LLM giving different answers depending on the input.

LLMs are actually designed to have some randomness in their responses.

To make the answer reproducible, set the temperature to O (eliminating randomness) and provide a static seed (ensuring consistent results) in the LLM's configuration.

jll29 · 2025-04-15T13:38:25 1744724305

The influence of the (pseudo-)random number generator is called "temperature" in most models.

Setting it to 0 in theory eliminates all randomness, and instead of choosing one from a list of next words that may be predicted, always only the MOST PROBABLY word would be chosen.

However, in practice, setting the temperature to 0 in most GUIs does not actually set the temperature to 0, but to a "very small" value ("epsilon"), the reason being to avoid a division by zero exception/crawsh in a mathematical formula. So don't be surprised if you cannot get rid of random behavior entirely.

Asraelite · 2025-04-15T18:12:40 1744740760

> the reason being to avoid a division by zero exception/crawsh in a mathematical formula

Why don't they just special-case it?

lolinder · 2025-04-15T13:23:39 1744723419

It's not necessary in most inference engines I've seen to set the temperature to 0—the randomness in the temperature is drawn from the seed, so a static seed will work for any temperature.

jananas · 2025-04-15T17:23:35 1744737815

In had it doing the reasoning in Turkish and English despite the question being in German.

ricochet11 · 2025-04-15T14:15:32 1744726532

i’ve seen that with deepseek