Search CORE

14 research outputs found

Learning curve parameters.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

The number of characters learned N, final learning efficiency Λf, and integral learning efficiency 〈Λ〉 for reference cumulative learning costs of C0 = 500 and C0 = 1500. The Yan et al. algorithm was optimized up to a cumulative learning cost of C0 = 4000.</p

FigShare

Learning curves for characters and words.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

The green curves correspond to HSK word lists for levels 1 to 4 (shorter curve) and 1 to 6 (longer curve). The yellow curves correspond to word lists generated from two levels of beginner readers. All curves were created using the OLS character decompositions.</p

FigShare

Measures of learning efficiency.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

The curves A and B represent two different learning curves. For each curve, the final learning efficiency Λf is the cumulative usage frequency for a specific cumulative learning cost C0, and the integral learning efficiency 〈Λ〉 is the average cumulative usage frequency between the origin and C0. Curve A has higher Λf but lower 〈Λ〉. Illustrated values for 〈Λ〉 are approximate.</p

FigShare

The first 85 characters of our optimized learning order.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

Taken together these characters have a cumulative usage frequency of 0.42.</p

FigShare

A network where our algorithm does not return the optimal character order.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

A hypothetical network where the integral learning efficiency of the order generated by the algorithm is lower than another possible order. Letters represent Chinese characters (for example, E is a compound character formed from primitives A and B) and the numbers are centralities. Both orders have identical final learning efficiencies.</p

FigShare

Usage frequencies for the first 85 characters.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

The gray, green and blue bars correspond to the black, green and blue curves in Fig 8. Dark bars represent primitives and light bars represent compounds.</p

FigShare

Usage frequency versus number of unique components for the 1000 most common Chinese characters.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

This plot shows the weak relationship between character usage frequency and complexity, the latter represented by the number of unique components used to construct the character. Usage frequency is normalized to 1.0 over the whole usage frequency data set, which encompasses more characters than shown in this plot. The six characters illustrated are the most common in each column. Note that the number of unique components is not the same as visual complexity: the characters 我 and 说 have similar visual complexity (they are composed of similar numbers of strokes) but 我 is conceptually more simple, being, in the OLS character decomposition, composed of two relatively complex primitive components 手 and 戈, compared with the four from which 说 is composed.</p

FigShare

Illustration of the topological sort algorithm.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

The ordered list is processed from low to high centrality (right to left in the figure). Once 的 is reached, its components are checked in turn. 白 is found to lie to the right of 的 and so is repositioned to its left. Likewise 勺 is found to the right of 的 and is similarly repositioned. 勺 is positioned to the right of 白 because it has lower centrality. The centralities used in this figure are for illustrative purposes only.</p

FigShare

Structural decomposition of the character 照.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

Primitive characters appear as characters in their own right whereas primitive components do not. The primitive component 灬 is an abbreviated form of the primitive character 火. The parameter r is the SUBTLEX-CH usage frequency rank of the character. Pronunciations are given in pinyin romanization. Note that each character is only assigned a single meaning even though most actually possess a range of broadly related meanings.</p

FigShare

Measures of character clustering.

Author: James C. Loach (3196182)
Jinzhao Wang (1556059)
Publication venue
Publication date
Field of study

The top panel shows the average distance, in number of characters, to the closest preceding component. The bottom panel shows the average distance, in number of characters, to another character which shares a component. Curves were generated with a fixed cumulative learning cost of C0 = 4000. Averages below 250 characters are not shown because in this region the averages fluctuate wildly.</p

FigShare