Skip to main content

Double sucess at AHRC Big Data projects call

Jo Bates and Robert Villa have both been successful applying to the AHRC's "Digital Transformations in the Arts and Humanities" Big Data projects call.

Jo Bates - The Secret Life of a Weather Datum

The Secret Life of a Weather Datum is a 15 month research project that will explore the socio-cultural values and practices shaping, and being shaped by, the production, collation, distribution and re-use of weather data produced by the UK’s Met Office. In order to achieve this aim, the project will be following the ‘journey’ of a single weather datum from its production into three cases of re-use: climate science, weather risk markets and citizen science projects. These cases will comprise of interviews, observations, digital ethnography and policy research.

The final outcome of the project will be an interactive website and multimedia research data archive that will allow members of the public to explore this journey in more detail, thus contributing to the public understanding of science.

The project is being developed in collaboration with Dr Yuwei Lin of the University for the Creative Arts, and the infrastructure for the interactive website will be produced by developers at Madlab in Manchester.

Robert Villa - Understanding the annotation process: annotation for Big data

Big data, by definition, assumes large, rapidly changing, heterogeneous collections of material and new techniques for the processing of this data. As more digital technologies are becoming ubiquitous many data collections, in fields including humanities, art, culture etc. are becoming larger. Many of the technologies associated with “big data”, used to make sense of these large collections, are related to machine learning and associated technologies.

Such techniques are typically based on learning, where some smaller manually created training set is created and fed to an automatic technique, after which the trained technique can then be applied to the full data set. While much scientific effort has gone into improving machine learning and automatic methods of retrieval, less work has gone into the process of how to create training sets. The creation of training sets is, ultimately, a human endeavour.

The project, “Understanding the Annotation Process: Annotation for Big Data”, aims to investigate this less studied area: how can we efficiently and with the least effort create training sets which can be used by automatic techniques to learn from? The project has three main research questions:
 - How do human assessors judge and assess text documents, images, and videos?
 - What are the main factors which affect assessor performance (e.g. accuracy, speed, etc.)?
 - What material is most easy for human assessors to judge, and which will also give the best "bang for the buck" when used as input to a machine learning system?

Without training sets, or learning data, automatic machine learning techniques cannot operate, making the creation of training sets a vital component for analysing and understanding big data collections. By learning more about the human process by which people judge material for the purposes of training set creation, we aim to create “best practice” guidelines which can be used by other researchers who require training or evaluation data.

This project is in collaboration with Dr Martin Halvey at Glasgow Caledonian University, along with Dr Jeremy Pickens (Catalyst Repository Systems), the National Fairground Archive, University of Sheffield, and the British Universities Film & Video Council.


Popular posts from this blog

Raspberry Pi Weather Project now live

A project to create a raspberry pi weather station is currently live in the Information School.  The Sheffield Pi weather station has been created by Romilly Close, undergraduate Aerospace Engineering student at the University of Sheffield.  The project was funded by the Sheffield Undergraduate Research Experience (SURE) scheme and is being supervised by Dr Jo Bates, Paula Goodale and Fred Sonnenwald from the Information School. Information about the Sheffield Pi station and how to create your own can be found on the project website .  You can also see live data from the Sheffield Pi station on , and further information can also be found on the Met Office Weather Observations Website .    This work compliments the School’s existing project entitled ‘The Secret Life of a Weather Datum’ which explores socio-cultural influences on weather data.  This project is funded under the AHRC’s Digital Transformations Big Data call.  It aims to pilot a new approach to im

Our Chemoinformatics Group wins Jason Farradane Award

The Information School's Chemoinformatics Research Group has been awarded the 2012 UKeiG Jason Farradane Award , in recognition of its outstanding 40 year contribution to the information field. The prize is awarded to the three current members of the group,  Professor Val Gillet , Dr John Holliday and Professor Peter Willett . The judges recognised the Group's status as one of the world's leading centres of chemoinformatics research, a major contributor to the field of information science, and an exemplar in raising the profile of the information profession. The School has a long association with the Farradane prize. Its second recipient was long time member of staff Professor Mike Lynch in 1980.

Professor Mike Thelwall gives inaugural lecture

Professor of Data Science Mike Thelwall recently gave his inaugural lecture at the University of Sheffield, entitled  How helpful are AI and bibliometrics for assessing the quality of academic research? The lecture, delivered in the University's Diamond building, was introduced by Head of the Information School Professor Briony Birdi. It covered Mike's research into whether Artificial Intelligence can inform - or replace - expert peer review in the journal article publication process and what this could look like, as well as to what extent bibliometrics and citation statistics can play a role in assessing the quality of a piece of research. Mike also discussed whether tools like ChatGPT can accurately detect research quality. The inaugural lecture was well attended by colleagues from around the University.