Skip to content

Conversation

@weissenh
Copy link
Contributor

@weissenh weissenh commented Nov 19, 2025

TL;DR there was only one author page (strangely with a suffix -tu) containing 63 papers. Five different authors have been identified and got their own id/author page. There are still six papers remaining in the catch-all. The original page with -tu suffix is no longer valid (could choose between catch-all chao-zhang, -pku, -cambridge, -uiuc, -ustc, -zju)

❓ TODO: why is the preview still displaying non-empty site for -tu while I cannot find any instance of it in the code anymore nor any link to it from other preview site? Is it because it was initially there, before I changed it to be -pku ? Is the build is not deleting sites/links?

(Please replace this text with a description of the changes effected by this pull request.
Include a link to the corresponding Github Issue, if there is one.
Details on how to do this (can be found here).)

Closes #5162

Related: #3243

Status Quo

In XML found 4 ORCIDs:

Changes

Needs to sum to 63 papers

Catch-all remaining, author affiliations:

  • chengdu (2009)
  • baidu (2013, 2022)
  • toshiba europe (2025, 2024-semantic)
  • peking (2024-eagle)

Call to action for issue submitters

History of GitHub issues

Evidence for each author / further links

Click below triangle to (un)collapse

Click here to uncollapse

Cambridge one ("cz277"):

PKU / Tsinghua one:

UIUC / Georgia one:

  • ORCID: 0000-0003-3009-598X : says Georgia employee, 61 works including some ACL ones
  • Homepage: http://chaozhang.org/ : lists publications and also links to Google Scholar
  • OpenReview: https://openreview.net/profile?id=~Chao_Zhang15 : mentions Homepage, ORCID, Google Scholar, DBLP, PhD at UIUC since 2019 at Georgia, 142 publications
  • in XML 11 times with ORCID, many more without, but with affiliation Georgia in XML often
  • https://preview.aclanthology.org/author-page-chao-zhang2/people/chao-zhang-uiuc/ (36 papers)
    • the 6 from 2025 all have orcid in xml
    • another 5 papers from 2024 come with ORCID in XML too: findings and emnlp ones
    • consistent affiliation and email address in PDFs at gatech, aside from 2 papers in 2023 emnlp findings ( 2023.findings-emnlp.798 "Improving Consistency..." , 2023.findings-emnlp.542 "Knowledge-Selective...") with Amazon affiliation, which are found on his website along with his other publications.

ZJU Zhejiang one:

USTC one:

@weissenh weissenh added this to the Author page backlog milestone Nov 19, 2025
@weissenh weissenh self-assigned this Nov 19, 2025
matching his own website. The person is now at Tsinghua (which is just next to Peking University), OpenReview reports PhD at "Peking University, Tsinghua University (pku.edu.cn)"
@weissenh weissenh marked this pull request as ready for review November 25, 2025 19:09
@mbollmann
Copy link
Member

TL;DR there was only one author page (strangely with a suffix -tu) containing 63 papers.

From looking at 44afb955b, it seems the intention at the time was to disambiguate the Chao Zhang from Tsinghua University from others by creating an ID for them and assigning it to two papers, but without also creating a catch-all ID this did absolutely nothing. The commit also did the same for Yifan Peng, Kexin Wang, and Weiwei Sun — I haven’t checked if those also have open issues already.

❓ TODO: why is the preview still displaying non-empty site for -tu while I cannot find any instance of it in the code anymore nor any link to it from other preview site? Is it because it was initially there, before I changed it to be -pku ? Is the build is not deleting sites/links?

The preview branches are not deleting files on new builds. I’ve stumbled over this a few times.

Copy link
Member

@mbollmann mbollmann left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I spot-checked a few and this LGTM, but I would personally opt not to change the existing ID chao-zhang-tu as it’s already linked from the author’s profiles. If I understand correctly, the abbreviation refers to the author’s current affiliation rather than the highest-degree institution, but I think of this convention more as a convention for creating new IDs, not for forcibly changing existing ones. @mjpost @nscheid thoughts?

@weissenh
Copy link
Contributor Author

I would personally opt not to change the existing ID chao-zhang-tu as it’s already linked from the author’s profiles. If I understand correctly, the abbreviation refers to the author’s current affiliation rather than the highest-degree institution, but I think of this convention more as a convention for creating new IDs, not for forcibly changing existing ones.

I understand this point. I will change it back if this is the consensus.

If that wasn't clear enough from my notes above - there are at least two persons who seem to be at Tsinghua this year: -cambridge and -pku. The former was the first to ask and how the -tu page came about initially, the latter asked later (issue still open) and on OpenReview lists Tsinghua&Peking University for PhD / on personal website Peking University as PhD institution.

I remember the reason to use degree institution is so that people don't ask to change their data with every affiliation change, but I guess it was more about the comment field not the URL. Would anyone ask us to change their author page URL because they are no longer at suffix institution or because they obtained a higher degree from another institution?
Btw -tu is not a very "unique" suffix anyway.

Let me know how I should proceed.

@weissenh
Copy link
Contributor Author

The commit also did the same for Yifan Peng, Kexin Wang, and Weiwei Sun — I haven’t checked if those also have open issues already.

Good point about other authors:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Author Page: chao-zhang-tu

3 participants