I am currently a PhD student in CS at Georgetown University and working on Natural Language Processing (NLP), advised by Prof. Nathan Schneider.
I am a member of Nathan’s Excellent Research Team (NERT), Corpling led by Amir Zeldes, and GUCL.
I currently focus on datasets and algorithms using computational and linguistic structures for challenges in understanding across languages and scenarios.
Recently, I have been working on things related to Legal Interpretation, Coptic, Discourse, and Metalinguistic NLP.
News
- 2025-10 : Two workshop items at EMNLP 2025.
- On Legal interpretation with LLMs in NLLPW 2025, presented by Junghyun Min. (abstract preprint)
- DeDisCo at the DISRPT 2025 Shared Task in CODI-CRAC 2025, presented virtually by Zhuoxuan Ju. (paper demo)
- 2025-09 : DeDisCo, led by Zhuoxuan Ju and Jingni Wu, and mentor Amir Zeldes won the 2025 DISRPT Shared Task on Discourse Relation Classification across Formalisms. (demo, news)
- 2025-05 : Fortunate to close my first academic year at Georgetown CS with the Outstanding TA Award.
- 2025-03 : It was an honor to be nominated for Graduate Student Teaching Assistant Award in Georgetown University’s 2025 Graduate Student Awards.
Publications
Coverage
-
‘DeDisCo’, our winning in 2025 DISRPT Shared Task on Discourse Relation Classification across Formalisms
By Georgetown Grad Studies, By ‘The Hoya’, By Georgetown Linguistics
Languages
Luke Gessler said that “linguists are often asked how many languages they speak” and now that I work with them a lot, here are mine:
- Kannada (kan)(native, ~B1 level)
- Hindi (hin)(intermediate, ~B1 level)
- English (eng)(professional, ~C1 level),
- Spanish (spa)(a single university course, ~A1 level).
My current language goals are to be able to
- converse and read (a specific novel) in Spanish
- read Konkani (kok) with Kannada script.
- read and work with Coptic (cop).
Similarly, if you ask me how many programming languages I can program in, I will say (like I tell my students)
“it’s not about the number of programming language(s), it’s about the principles”
but then go on to say I have programmed in many languages:
Python (research and tooling), Java (enterprise cloud software), Scala (TAing and side projects, linkedin assessment), C (let’s say structs and pointers), Javascript (up to JQuery, +Angular, Node), Scheme, OCaml (coursework) and Kotlin (graduate software engineering project).
Next Moving Targets: R has priority now over Julia for visualization. More proficiency in OCaml because of Grew.
Writing
I write technical and non-technical blog articles infrequently, you can find them on my medium site here.
Bookshelf
I like to read (largely fiction) and for better or worse track it on Goodreads. If you are interested you can find my shelves here. If you ever meet me, feel free to ask for a personalized recommendation :).
Some lists
I maintain a few (interesting) non-technical lists for fun! I made them separate sites to make space this page compact.
Source
Xiulin Yang’s spiffy site made me move from my old site (archiving in progress…) to this new site built with Rayeren’s acad-homepage. My code is available in Github.