3673 Idiomatic Expressions (mostly NOT Chengyu)

Bosimao

Member
What this is
These are are all idioms or proverbs in a broad sense, but not all would be recognised as chengyu in standard dictionaries. On Pleco, the definitions for most of them should exist and you may often see them having the tags "IDIOM", "WELL-KNOWN PHRASE", “COMMON PHRASE", "COLLOQUIAL", "FIGURATIVE COLLOQUIAL" etc.

If any of them are missing a Pleco dictionary entry, check the internet to see if there’s some mention of what it means somewhere and report it as a missing entry. Otherwise, don’t report it and just let me know here on the forums. We don’t want to give the Pleco team a hard time :). Thank you for your understanding.

Why you should study this over Chengyu
TLDR; used more IRL.

From my experience learning and using Chinese in real-life settings, I’ve found that my idiom list is far more useful than studying standard chengyu. The expressions in my file come up constantly in casual conversations, on TV, in online comments, even in arguments. They match the tone and rhythm of how people actually speak. I’ve seen people use them to joke, to complain, to criticise, or to make a point quickly. They’re flexible, punchy, and immediately understood by native speakers.

In contrast, standard chengyu feel more like literary tools. A lot of them are tied to historical stories or Classical Chinese, which means they don’t come naturally unless I’m deliberately trying to sound bookish or formal. Sure, they’re recognised and appreciated, but they’re not what people reach for in everyday conversation. When I hear chengyu in real life, they’re often sprinkled in sparingly, usually in writing or more prepared speech, not spontaneous talk.

So for practical, conversational fluency, my file is just more relevant. It gives me the kind of language that makes people say, “Oh, you really get how we talk.” That matters more to me than impressing someone with a line from a war story.

Breakdown
Rough count
  • Total entries in the file: 3 673.
  • Entries that are exactly four Chinese characters and contain no punctuation (the standard textbook definition of a 成语): 881.
Edit: the spiel about why it’s useful.
 

Attachments

  • Idioms (Not Chengyu) PlecFlash.txt
    53.7 KB · Views: 573
Last edited:

Shun

状元
Hi Bosimao,

thank you very much, these look really nice indeed—taken from real-life language, with a lot of character! For upper intermediate learners, they are definitely a great on-ramp to advanced colloquial Chinese.

Best,

Shun
 

Shun

状元
Hi Joseph,

I couldn't figure out a good way to do it with the BCC frequency list because I couldn't reliably detect either easiness or difficulty with it. Any expression in BCC with a low frequency isn't necessarily difficult, and difficult vocabulary can still have a relatively high frequency.

Maybe you could feed the list to some Chinese LLM and ask it to return all idioms containing easy/basic vocubulary?

Cheers,

Shun
 

josephzizys

Member
So I have filtered all the entries in Bosimao's file for just those with 3 characters, on the theory that most are likely to be guànyòngyǔ, I hope to get these ordered by frequency, but I am not a programmer so it may take me some time! :) In the meantime here is the file for anyone interested;
 

Attachments

  • guànyòngyǔ.txt
    17.4 KB · Views: 6

josephzizys

Member
SO what I am hoping to do with the help of ai, or anyone interested, is to get a list of Chinese you tube channels and then scrape their closed captions for the guànyòngyǔ and thereby order the list.

So I guess my first question is what channels do people recommend for colloquialisms in Chinese? are there any good "street interview" channels or other places that have good content for this sort of thing?
 

Shun

状元
I like the idea of scraping YouTube closed captions.

But: You can't expect others to find YouTube channels for you. You ought to do that yourself.
 
Top