Big Humanities Data: Workshop Program

“Big Humanities Data”
The Third 2015 IEEE Big Humanities Data Workshop

Thursday October 29, 2015

Hyatt Regency Santa Clara
5101 Great America Parkway
Santa Clara,  CA 95054 USA

Mark Hedges — Richard Marciano

8:00-8:05 Welcome (35 participants)


8:05-10:05 First Session (2 hours)

THEMES: data mining 

  1. Mining Local Gazetters of Literary Chinese with CRF and Pattern based Methods for Biographical Information in Chinese History (Chao-Lin LIU, Chih-Kai Huang, Hongsu Wang, and Peter Bol: National Chengchi U. (Taiwan) / Harvard U. (USA)
  2. Metaphor Mining in Historical German Novels: An Unsupervised Learning Approach (Stefan PERNES: U. WURZBURG (GE)
  3. The Coding of Literary Form: Data Mining and the Information Structure of Historical Texts (Dallas LIDDLE: Augsburg (USA)
  4. Plot Arceology: a Vector-Space Model of Narrative Structure (Benjamin SCHMIDT: Northeastern U. (USA)
  5. Scaling Out for Extreme Scale Corpus Data (Matthew COOLE, Paul Rayson, and John Mariani: Lancaster U. (UK)
  6. A Method for Cross-Document Narrative Alignment of a Two-Hundred-Sixty-Million Word Corpus (Ben MILLER, Jennifer Olive, Shakthidhar Gopavaram, Yanjun ZHAO, Ayush Shrestha, and Cynthia Berger: Georgia State U. (USA) / Indiana U. (USA) / Troy U. (USA) / AirSage

10:05-10:20 Coffee Break

10:20-11:20 Second Session (1 hour)

THEMES: Twitter, social data

  1. Predicting Social Trends from Non-photographic Images on Twitter (Mehrdad YAZDANI: U. California San Diego (USA) and Lev Manovich: CUNY (USA)
  2. Towards a Mobile Social Data CommonsGiles GREENWAY, Leonard Mack, Tobias Blanke, Mark Cote, and Tom Heath: King’s College London (UK) / Open Data Institute (UK)
  3. Mixed-Initiative Social Media Analytics at the World Bank: Observations of Citizen Sentiment in Twitter Data to Explore “Trust” of Political Actors and State Institutions and its Relationship to Social Protest (Nadya Calderon, Brian Fisher, Jeff Hemsley, Billy Cesavich, Greg Jansen, Richard Marciano, and Victoria LEMIEUX: Simon Fraser U. (CA) / Syracuse U. (USA) / U. Maryland (USA) / The World Bank & The U. of British Columbia (CA)

11:20-12:05 Panel Session with Stanford: CESTA, Big Data and Digital Humanities

Chair:  Mark Hedges, King’s College London


Stanford Panelists:  CESTA, Text Technologies, Libraries


  • Gabriel Wolfenstein: CESTA — “Big Data at CESTA”
  • Elaine Treharne and Celena Allen: Stanford Text Technologies & CESTA
  • Benjamin Albritton: Stanford U. Libraries