top of page

Jessica Witte

Digital Research Analyst

University of Edinburgh

WEBSITES

Institutional

CONNECT

EMAIL TRANSP.png
orcid-icon-sm.png
Linkedin_circle_black-512.png
pngimg.com - github_PNG40.png

RESEARCH INTERESTS

PROJECT

THISTLE: Technology for Handling and Improving Stubborn Text-Level Errors

THISTLE explores how large language models (LLMs) can be used to improve the quality and accessibility of digitised historical newspapers. Focusing on a nineteenth-century subset of digitised issues of The Scotsman, this project will develop and test a pilot pipeline that uses fine-tuned LLMs to correct complex OCR errors in historical texts. The aim is to produce cleaner, more reliable machine-readable text that is suitable for computational analysis.

Digitised image of the front page of the first issue of The Scotsman newspaper; Public domain CC0

DISKAH Logo - Horizontal - Total Black.png

This website has been produced and is managed by the coordinators of the DISKAH project at the University of Brighton. The ‘Digital Skills in Arts and Humanities (DISKAH): Transforming Access to Digital Infrastructure and Skills‘ project has been funded by UKRI (Grant No. APP4595).

DISKAH builds on the previous projects of the Digital Skills Network in the Arts and Humanities, which received funding by the ​​​​​​AHRC under the ‘Embed digital skills in arts and humanities research scheme‘, aiming at addressing the digital skills gap within the arts and humanities research community.

University of Brighton | Cockcroft 402 | BN2 4GJ | Brighton

Network 
Facilitators

Follow us

  • bluesky-black-round-circle-logo-24460-transp
  • LinkedIn
University_of_Brighton_logo_edited.png
UAL_Lockup_LCC_BLACK.png
Durham University Logo_ 100_BLACK%.png
exeter-logo (1).png

Funded by

UKRI-Logo_Horiz-RGB.png

© 2025 by DISKAH Network. Powered and secured by Wix. Designed by Raffaella Losito.

bottom of page