Cameron D. Campbell 康文林

Family, Social Mobility, and Inequality in China and in Comparative Perspective

Menu
  • Research
    • Abridged CV
    • Full CV (PDF)
    • 2 page CV (PDF)
    • Google Scholar
    • 百度学术
    • ORCID
    • HKUST Repository
  • News
  • Data
    • China Government Employee Database – Qing (CGED-Q) 中国历史官员量化数据库(清代)
      • Download Data
      • Search by Name
      • CGED-Q Jinshenlu Public Release – Resources for Users
    • China Multigenerational Panel Databases 中國多代人口数据庫
      • Download Data
  • Lee-Campbell Group
    • People
    • Projects
    • Publications
  • Photography
    • Photo site 摄影网站
    • Map view
    • Updates
  • Contact
Menu

Paper on nominative linkage in the CGED-Q in Historical Life Course Studies

Posted on September 1, 2022September 9, 2022 by camecamp

Our paper “Nominative Linkage of Records of Officials in the China Government Employee Dataset-Qing (CGED-Q)” which shares our experience with nominative linkage in the CGED-Q has been published at Historical Life Course Studies. We hope it will be useful to others who are engaged in large-scale, automated nominative linkage (disambiguation) of individuals in historical Chinese-language sources.

While the approach that we arrived at after many iterations may be specific to the CGED-Q and its contents, we think that our summary of the challenges that we encountered will be of broader interest, and our methods should at least be a roadmap for others with related projects.

Major issues we document and then address include the use of variant orthographies for the same character in different editions or sources, replacement of characters with ones that look similar but are actually completely different, replacement of characters with homophones, inconsistencies in the writing of the names of counties, and changes in boundaries that led the same county to be associated with different provinces in different sources or editions.

Here is the abstract:

We introduce our approach to the nominative linkage of records of Qing officials who were included in the China Government Employee Datasets-Qing (CGED-Q) Jinshenlu (JSL) and Examination Records (ER). We constructed these datasets by transcription of quarterly rosters of civil and military officials produced by the government and by commercial presses, and records of examination degree holders. We assess each of the primary attributes available in the original sources in terms of their usefulness for disambiguation, focusing on their diversity and potential for inconsistent recording. For officials who were not affiliated with the Eight Banners, these primary attributes include surname, given name, and province and county of origin. For the small subset of officials who were affiliated with the Bannermen, we assess the available data separately. We also assess secondary attributes available in the data that may be useful for adjudicating candidate matches. We then describe the approach that we developed that addresses the issues we identified with the primary and secondary attributes. The issues we have identified and the approach that we have developed will be of interest to researchers engaged in similar efforts to construct and link datasets based on elite males in historical China.

Here is the paper at the HLCS website.

We have also made available the complete tabulations that are the basis of the tables in the paper. These include the frequencies of surnames and given names in the CGED-Q JSL, and the frequencies of discordance across record of the same individual in the recording of surnames, characters in given names, and place of origin. The tabulations can be downloaded at the HKUST and Harvard Dataspaces:

https://dataspace.hkust.edu.hk/dataset.xhtml?persistentId=doi:10.14711/dataset/M8HQEA

https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/4OSP8V

Those not specifically interested in linkage may still be interested in the tabulations of surnames and characters in given names.

  • Instagram
  • Photography website
  • Bluesky
  • LinkedIn

Recent Posts

  • New piece in Guangdong Social Science

    March 29, 2025
  • New article in Explorations in Economic History

    March 21, 2025
  • China Government Employee Dataset-Beiyang (CGED-BY) added to online search

    February 11, 2025
  • Paper in 历史档案 (Historical Archives) by Chen Jun on mid- and low-level Qing military officials

    October 20, 2024
  • Kinship information in the 同年齿录 and related sources completed in August 2024

    August 28, 2024
  • CGED-Q Meeting at Central China Normal University, July 29, 2024-August 2, 2024

    August 6, 2024

Recent Photography

  • HKUST Guangzhou 香港科技大學(廣州)

    March 29, 2025
  • Guozijian and Confucius Temple in Beijing 北京國子監及孔廟

    March 29, 2025
  • Yonghegong in Beijing 北京雍和宮

    March 29, 2025
  • Sunset at Razor Hill, near the HKUST campus 鷓鴣山日落竟

    February 15, 2025
  • Taiwan Province City God Temple 台灣省城隍廟

    February 15, 2025
  • Zhongzheng District in Taipei 臺北中正區

    February 15, 2025
  • A walk from Oban to Dunbeg and back, in the winter

    February 15, 2025

©2025 Cameron D. Campbell 康文林 | Theme by SuperbThemes