卷首 Home

今日誦詩
大雅期
毛245
旱麓
大雅 · 文王之什
瞻彼旱麓,榛楛濟濟。豈弟君子,干祿豈弟。
瑟彼玉瓚,黃流在中。豈弟君子,福祿攸降。
鳶飛戾天,魚躍于淵。豈弟君子,遐不作人。
清酒既載,騂牡既備。以享以祀,以介景福。
瑟彼柞棫,民所燎矣。豈弟君子,神所勞矣。
莫莫葛藟,施于條枚。豈弟君子,求福不回。

Welcome to Fanya Hanwen Corpus 泛亞漢文語料庫, a Literary Chinese corpus containing over twice as much text as Siku Quanshu! By employing a pan-Asian, diachronic perspective, the Fanya Hanwen Corpus hopes to propel the Literary Chinese tradition into the modern age.

序 Introduction

Welcome to Fanya Hanwen Corpus! It contains texts from each Chinese dynasty, the Korean peninsula, Japan, Vietnam, and the Ryukyu Kingdom, Singapore, and myriad historical countries and regions. Literary Chinese was and is a living language with a vibrant diversity in texts; it once enjoyed a similar status to Latin during its heyday and should be treated as such. This project has been ongoing for around a year with just myself. It has been a crushing amount of work, attempting to produce a website that meets three goals: (i) Develop a region and culture-first view of Literary Chinese, (ii) Develop a modern educational model that works with this view, and (iii) Develop an accessible website that operationalises these two goals and casualises the information that comes from them. I do not believe this success has been met yet, but it is progressing admirably.

語庫讀器 Corpus Viewer

One of the biggest wins of the project so far has been an elaborate Corpus Viewer, which aims to make the Literary Chinese experience as accessible as can be for everyone. This includes:

字典 Dictionary

This website contains a fairly robust dictionary that aims to be as inclusive as possible. At present, it contains Unihan data, Ruby text settings, ample phoneticization conventions, and the option to switch between Traditional and Simplified Chinese (as well as the original text in case of mix/var forms) whilst viewing things. Additionally, this database contains some variant form research derived from independent research and open-cc, which hopes to map variants as exhaustively as possible in hopes of an eventual text normaliser. Please give feedback on this!

The main dictionary definitions, which can appear in popups, come from CC-CEDICT, Unihan, and Kangxi Dictionary. Kangxi Dictionary is regex'd to have linebreaks according to specific wording patterns in hopes of making it more readable, and so far it seems to have been successful. The Guangyun and Shuowen Jiezi have also been turned into browsable Dictionaries, and their sorting mechanisms have been documented.

There are hopes to grab more dictionaries for comparison, such as these dictionaries, which are used in Siku Quanshu:

Of course, more dictionaries are desired, such as those from non-Chinese nations. The aim is to make immersive learning of Literary Chinese through dictionaries as easy as possible within modern pedagogical frameworks, whilst also promoting scholarly study.

Text Submission

Want to add a new text to the corpus? This creates a review ticket rather than writing straight into the corpus.

來想 Future Plans

There are hopes to implement a nation atlas, grammar wiki, and more detailed variant/corrected text items over time. Synopses, blurbs, and summaries are also desired.

所載 Publications

I myself (Llinos Evans) have used this corpus to produce a few publications, some before the corpus was fully established. These go as follows...

引據 References

The following texts were used in the making of this website, the corpus, and more.