Home » Uncategorised » Page 19

Using novel types of data to detect illness caused by contaminated food or drink

CDRC PhD Student Rachel Oldroyd is one of the UK Data Service Data Impact Fellows. Rachel is a quantitative human geographer based at the Consumer Data Research Centre (CDRC)at the University of Leeds, and here discusses how novel types of data are used to detect illness caused by contaminated food or drink.

Affecting an estimated 1 million people at a cost of around £1.5 billion per year (Food Standards Agency, 2011), foodborne illness remains an unacceptably high burden on the UK population and economy. As many victims choose to recover at home without visiting their GP, the number of cases is difficult to measure and severely under reported in national data.

But what is foodborne illness? The World Health Organisation defines it as an Infectious Intestinal Disease caused by the ingestion of a harmful parasite, virus or bacteria, known as a pathogen. A pathogen can infiltrate any part of the food supply chain and can be hard to detect, but will result in symptoms ranging from mild nausea to death. With around 500 annual deaths in the UK attributed to food poisoning, the Food Standards Agency (FSA) are continually developing methods to support their key objective to reduce its incidence.

I currently teach Geographic Information Systems in the School of Geography and I’m studying towards a PhD in the Consumer Data Research Centre, both at the University of Leeds. My research is focused around data analytics for food safety. In particular, exploring the landscape of food safety in the UK and investigating the utility of novel types of data. For the first part of my research I plan to extract geo-demographic variables from the 2011 Census, investigating relationships between these variables and food safety measures (restaurant hygiene scores, hospital admissions and mortality). The second part of the research will focus on the use of novel types of data and Natural Language Processing to detect cases of foodborne illness reported through online reviews and social media. It is hoped that these datasets may provide additional information missed by the traditional GP reporting process.

Many US studies have researched the use of novel types of data for disease detection, reporting timeliness and the inclusion of additional information as key advantages compared to traditional GP data. For example, in a restaurant review, customers may comment on the cleanliness of the restaurant, the quality of the service and/or describe the food they ate. These user-generated comments are extremely useful and are not available from traditional data sources. However, extracting reviews within which customers report illnesses can be difficult. It is not as simple as looking for specific keyword matches, as these will often return false results; for example searching for ‘sick’ may return ‘I’ve never known anyone get sick here’. This is where Natural Language Processing plays its part. A model can be trained to identify sequences of words which refer to illness and return relevant reviews; ignoring those which do not indicate illness.

It’s hoped that this research will have a strong policy impact and will be used to inform and improve the current restaurant inspection process in the UK. Throughout the research I plan to liaise with key industry professionals, including those from the FSA and the local authority to keep the research relevant and timely. I’m delighted to be named as one of the UK Data Service Data Impact Fellows and plan to take full advantage of the scholarship by developing impact through presentations at national and international conferences, disseminating the research through suitable publications and holding stakeholder events and public seminars. Watch this space!

GIS in Japan

Our Deputy Director, Professor Alex Singleton, dedicated a week to running a workshop at Ritsumeikan University, Kyoto, Japan.

Covering the basics of Geographic Data Science (GIS) and Urban Analytics with particular emphasis upon the study of cities through new forms of data, the final day included presentations of independent student projects that focused upon a range of different application areas. Winning projects were awarded with CDRC branded prizes, including a printed 2017 calendar featuring top maps from our mapping portal, maps.cdrc.ac.uk.

Alex’s workshop was received extremely well and proved to be an useful illustration of code based teaching of GIS  to new users of R.

Course Materials.

Alex Singleton along with his co-presenter Dr Daniel Arribas-Bel.

The CDRC and UK Data Service have enlisted Dr Daniel Arribas-Bel to host a webinar titled ‘Bringing new forms of data to the study of cities’. For more and to book a place.

 

Training Review – Smartphone Application Development for Data Collection Purposes

 

I recently completed a half-day course at LIDA on Smartphone Application Development for Data Collection purposes, the course was perfectly pitched so that those without a computer science background (most of us) were able to jump straight in. The course was introductory in nature but with enough expertise to give a sense of what you can expect down the line should you be able to develop your own native application. The course itself covered the different types apps from the more technical native apps to some of the more user-friendly app building tools. The technical details such as coding languages and the software packages you’ll need were explained and we were even given brief demonstrations of.

The course was invaluable for giving me the understanding how these applications could be be used in research. The course covered the important aspects of how data could be collected and stored through smartphone applications, and importantly what considerations you need to make given the different functions an app can have. This level of information and detail allowed me to think about how apps could be used in my own research which is looking at how incentives could be used in travel behavior. To this end it has developed a line of thinking in my research methods which might not have been possible without the opportunity to attend this course.

Connor Walsh, PhD Student, University of Leeds

We are currently confirming the details for the next Smartphone Application Development for Data Collection Purposes course.  To register your interest, please contact [email protected]

Analysing conversations on Twitter: Do e-petitions help to increase public engagement with politics?

 

CDRC Data Science Intern, Molly Asher, is currently conducting a pilot study on the effectiveness of the UK Parliament’s e-petitions system with the goal to determine whether e-petitions have led to a rise in trust in parliament, the development of new engagement with parliament and/or policy changes.

The full project will develop a comprehensive analysis of a parliamentary e-petitioning system, which integrates the views of petitioners, the actual processes dealing with petitions, the public debate surrounding the petitions, the views from policy-makers and the outcomes of petitions. The project adopts a mixed-methods and inter-disciplinary approach, incl. Data Science analysis of all petitions (incl. themes, success probability, geographical representation, timing of signatures, nature of the petition & partisanship) and the petition debates on Twitter (incl. themes, sentiment analysis, nature of the debate, e.g. the level of polarization and partisanship, geographical representation, social network analysis of those involved in the debate).

Molly is currently conducting a pilot study on the petition debates on Twitter and potentially linking them in an analysis of the e-petition data.  She presented her findings at a recent LIDA Seminar:

 

Data

Molly is currently using the following data for the pilot study project:

  • Twitter data collected via the Twitter Streaming API
  • UK parliament e-petition where JSON data files are available for download with each petition.

 

Oxford Retail Futures Conference

On 6 December 2016 the CDRC co-hosted the annual Oxford Retail Futures Conference at the Said Business School (SBS), University of Oxford. The focus of the event was to explore the current uses and challenges of using consumer data in the retail and supply chain environment, and to discuss developments of data analytics for a range of industry and research purposes.

The day consisted of a series of well received presentations and lengthy panel discussions concerning developments in data acquisition and analysis, understanding areas and activities using new forms of data and the implications of big data for the organisational aspects of firms. Interesting discussion surrounded future directions for the CDRC’s SmartStreetSensor project, the prospects of loyalty card data and using big data to inform organisational decision making.

Attendees and speakers included industry experts representing the likes of Walgreens Boots Alliance, the Local Data Company, GI Solutions Group and pgw Ltd. The day was brought to a close by Martin Squires (Global Lead, Customer Intelligence and Data) from Walgreens Boots Alliance, giving an insightful overview of a practitioner’s viewpoint of the uses of their customer loyalty card data and future directions for data analytics.

CDRC co-investigator awarded a Gold Medal

Professor Michael Batty, co-investigator of the Consumer Data Research Centre (CDRC), was awarded with the Royal Town Planning Institute’s (RTPI) highest honour – the Gold Medal.

The Gold Medal is only awarded once every 2 years to individuals who have made an outstanding contribution to the field of planning.

Mike was presented with the Gold Medal on 8 November by the RTPI Vice President, Stephen Wilkinson, after delivering the annual Nathaniel Lichfield Lecture.  Mike’s lecture was entitled “The Planning Balance Sheet 60 Years On: Evaluation in the Digital Age”. Click here for a downloadable version of the lecture.

Big Data Week 2016

We hosted and participated in a series of events for Big Data Week 2016 (24-29 October). Events varied from public exhibitions to lectures and were attended by a range of local, national and international attendees.

Big Data Here
CDRC UCL kicked off their week with ‘Big Data Here’, a week long exhibition of live data providing an alternative, digital view of the physical space around UCL. Data sources included transit feeds from Transport for London (TfL) and demographic information from the Office for National Statistics (ONS). Throughout the week a different live data feed projected through the window of UCL’s North Lodge, giving passer by’s an opportunity to view the visualisations culminating from the feeds.

The most popular big screen visualisation proved to be the near live traffic camera feeds from TfL. Members of the public were free to interact with the exhibition each afternoon with a CDRC team member on hand to provide a more detailed overview of the concept behind Big Data Here. This saw an excellent turnout of local, national and international attendees keen to learn more about the exhibits.

Big Data Here was led by Oliver O’Brien, creator of CDRC Maps. Read his summary of Big Data Here.

Big Data for Marketers
The CDRC team, along with a number of students from our MSc Consumer Analytics and Marketing Strategy course, participated in the ‘Big Data for Marketers’ event in Leeds.  Hosted by Alscient, the city sponsor for Leeds Big Data Week, the event focused on the importance of Big Data in Marketing, taking stock of current analytical practices in the industry and looking at ways to improve in the future.  A common theme in the presentations and subsequent discussion was the need for graduates to have solid analytics skills and an understanding of Big Data when entering the industry.

24 Hour Climathon – How can we deliver domestic carbon reduction in an age of austerity?
CDRC students took part in a 24 hour Climathon event in Leeds which aimed to develop new solutions to reduce carbon emissions  The event, held by the Priestley International Centre for Climate, enabled students and stakeholders in Leeds to join citizens in world cities such as Paris and Shanghai simultaneously taking part in the hackathon-style Climathon event.  In Leeds, 28 participants took part from 12 noon on 28 October until noon the following day and focused on an energy efficiency challenge set by Leeds City Council. Read more about the event.

Empty Housing Innovation Lab
The final event of the week took place at the University of Liverpool in London, in conjunction with Student Data Labs. The event, titled ‘Empty Housing Innovation Lab’, took an interactive approach to tackling the housing crisis in the UK, using big data. The event had an excellent turnout of students from local universities including London School of Economics (LSE), Imperial College and UCL . CDRC project manager Sarah Sheppard provided an opening talk, with emphasis on some of the data sources already available to students (and others) via our datastore that can be utilised for research; this was followed by Anastasia Ushakova, CDRC Phd Student, providing a lightning talk on her own research on energy consumption and how this ties in with the housing crisis. A range of interactive activities followed, with the R workshop proving extremely popular amongst attendees.

So you want to be a data scientist?
The momentum continued beyond Big Data Week in Leeds, with the ‘So you want to be a data scientist?’ event falling on the 2nd November.  Organised by Alscient, and attended by students from Leeds Data Science Society and the MSc Consumer Analytics and Marketing Strategy course, the session offered practical advice and mentoring from industry based data scientists, as well as the opportunity to take part in a mini analytics challenge.

Masters Research Dissertation Programme 2017: Call for Industry Projects

Following another successful year of the Consumer Data Research Centre (CDRC) Masters Research Dissertation Programme, we are now seeking proposals from businesses for new projects due to commence in spring 2017.

The CDRC are aiming to open the application process for masters students in January and to therefore encourage applicants from a wider breadth of disciplines and institutions. The application process will be facilitated on the CDRC website and businesses are encouraged to interview/select successful applicants in early 2017.

This presents a great opportunity to get a Masters student to help you to make progress with:

  • Major current issues, such as multi-channel marketing, customer insight, store networks, transport, surveys, social media, brand insight, predictive modelling and many others.
  • ‘horizon scanning’ projects that are not of the highest priority in the day-to-day work schedules
  • Working with data – both your own in-house data, and also maximising the value to be obtained from Open Data or data from government or administrative sources (available through the CDRC)

It also publicises your company’s interest in students with data skills. Previous experience illustrates the main features of the initiative:

If you have a project in mind, please complete a project proposal form 2017 and email it to Guy Lansley at [email protected] by the start of December 2016.

All proposals need to be approved before they can be publicised to ensure that the students can maximise the academic potential from their dissertations. Alternatively, feel free to email Guy informally if you would rather discuss rough ideas for projects at this stage.

For more details please visit the Information for Retailers page.

Founder and President of Esri at UCL

The Consumer Data Research Centre (CDRC) are pleased to announce that Jack Dangermond, Founder and President of Esri, will be visiting University College London (UCL) and giving a talk on his vision for how GIS and Geography can help address the global challenges we face today to shape the future for our world. Esri, founded in 1969, are a global leader in GIS software sales and consulting, operating across various public, private and third sectors. Esri’s solutions encompass not only desktop GIS but are continually expanding in web and mobile GIS applications, enabling new applications of GIS in research and beyond.

The talk will take place on 9 November 2016 at 4:15pm at UCL (Pearson Building G22) with a short presentation from key UCL research groups (CDRC, SpaceTimeLab, ExCiteS, CASA) on some of the high profile projects that Esri have helped to support, followed by Jack’s presentation on Esri’s vision as a company. Afterwards, a small reception, sponsored by Esri, will be held in the Pearson Building G07, where people can meet Esri staff and UCL students to continue discussions on future possibilities with GIS.

Places are limited so please book early and if you find you are unable to attend please let us know so your place can be given to someone else.

To make a booking click here.