Vocabulary size of Polish speakers

Czytaj po polsku

We present preliminary results from a Polish vocabulary test, designed for both native speakers and learners. Detailed information about the test methodology is available on the test description page.

Participants

To date, 2045 respondents have completed the test, including 1121 native speakers and 924 Polish learners. Let's look at their age distributions:

Histogram of age of Polish vocabulary test participants, native speakers only Histogram of age of Polish vocabulary test participants, learners only

Both groups are dominated by younger participants in high school to college age range, since we promoted the test mostly among them.

Vocabulary size

Let's now look at vocabulary sizes of native speakers and learners.

Histogram of vocabulary size of Polish vocabulary test participants, native speakers only

For native speakers:

These statistics combine results from native speakers of all ages. Later, we’ll explore how vocabulary size varies with age.

Histogram of vocabulary size of Polish vocabulary test participants, learners only

For learners:

These numbers reflect the vocabulary sizes of learners across all age groups combined.

Here is a comparison of native speakers and learners on the same plot:

Histogram of vocabulary size of Polish vocabulary test participants, overlapped native speakers and learners

Vocabulary size and age

Let's now dig little deeper and look into how vocabulary depends on age. On the next plot each point represents an individual participant:

Vocabulary size vs age for Polish vocabulary test participants, native speakers and learners

The data look a bit overwelming. Let's group participants by age and do some statistics so we can see trends. Let's start with native speakers.

Statistics on vocabulary size vs age for Polish vocabulary test participants, native speakers only

This is a box plot, where each box correspond to a group of participants of certain age (5-9, 10-14, 15-19, etc). Middle line of each box shows a median for the group, bottom line - 25th percentile, top line - 75th percentile; the box covers half of all participants within the group. Whiskers show 1.5x of interquartile range. Data outside of the whiskers is often considered outliers. Individual observations are shown as points.

Plotting data this way immediately allows us to see a trend. For native speakers, vocabulary size grows with age. It grows rapidly up to approximately 25 years (the period of formal education), than keeps going up slowly for the rest of life. Similar studies often report vocabulary decrease after around 55 years old; we do not see this effect. Here are the numbers:

Let's transition to learners.

Statistics on vocabulary size vs age for Polish vocabulary test participants, learners only

For learners:

Frequently asked questions

Can I compare these results with results of other vocabulary tests in Polish?

Unfortunately, you can't. It is impossible to compare results of any two vocabulary tests. First, all tests use different methodologies, so they measure slightly different aspects of one's vocabulary. Second, all tests define what is counted as a word differently. For example, some count derivative words, and some do not. Third, every test uses different definition of what it means to "know" a word. Finally, not all tests in the internet are created equal. Only a small fraction of online vocabulary tests are based on rigorous scientific methods.

How does avarege vocabulary of native speakers and learners in Polish compares to other languages?

Comparing vocabulary sizes across languages is nearly impossible.

How many words are in Polish language?

It is very difficult to determine the exact number of words in any language, because the estimates depend strongly on what is considered an independent word and what is considered a derivative. We use the PWN Dictionary of the Polish Language, which contains 140,000 words.


logo small

Last updated: February 2nd, 2025