Since the establishment of LIDA in 2014, much has been done to realise the ambitions of our founding members to escalate data analytics research and education rapidly across disciplinary borders. It is encouraging to note that the core activities of the institute are aligned more strongly than ever to the principles expressed in the refreshed Leeds University Strategy to support interdisciplinary research through research centres and institutes (https://spotlight.leeds.ac.uk/strategy-research-and-innovation/index.html), while capitalising fully on digital technology, data and digital approaches (https://spotlight.leeds.ac.uk/strategy/).
Research and Innovation
I am pleased to introduce our world-leading, award-winning, and innovative work in complex modelling, statistical data science, machine learning and AI in our second online LIDA Annual Showcase. As leaders, not followers, in data analytics and AI, we continue seeking to drive change through data across multiple disciplines, working with academics, industry, and organisations around the world to benefit populations on a global scale.
International symposiums conducted online over the past year meant collaborations could continue despite the impacts of Covid-19, meaning our important work encompassing health, societies, and environment has continued despite challenging circumstances.
During the last year LIDA created three data analytics research Communities – Societies, Health and Environment. These build on the University’s strengths and capabilities within these areas and are further supported by LIDA’s strategic ambitions in three research Programmes – Statistical Data Science, Artificial Intelligence and Immersive Technologies (see figure below). Each of the Communities now has a small leadership team of 3-4 academics whose role is to drive the Community. The leadership teams for the Programmes are under development. Finally, over a dozen Special Interest Groups were created that connect academics and end-users with interests in data analytics.
The case studies published below demonstrate in more detail some of the research activity currently being undertaken across these Communities:
Quality of care increased, but patients seeking care decreased. Indirect effects of the COVID-19 pandemic and their impact on cardiovascular patients.
Understanding how COVID-19 lockdowns may have influenced the health-seeking behaviour of heart patients, as well as delivery of care by specialist services, is vitally important. Leeds Institute for Data Analytics researchers have been at the forefront of an academic collaboration to monitor the impact of COVID-19 on emergency cardiac services, morbidity and mortality rates since 2019.
Improving outcomes for children and young people with cancer.
The Yorkshire Specialist Register of Cancer in Children and Young People (YSRCCYP) maintains accurate collection of sociodemographic and clinical data on children, teenagers and young adults with cancer in Yorkshire. With connections to NHS and other datasets, the YSRCCYP informs the decision-making of local clinicians and commissioners with collaborative national and international epidemiological and outcomes research supported with infrastructure, expertise, and an extensive database encompassing over 15,000 square kilometres of Yorkshire and the Humber, with a population of five million.
Changing and saving lives: A multidisciplinary modelling approach to looking at interruptions in care for cardiovascular patients.
The COVID-19 pandemic been an unprecedented global public health crisis. To limit direct impact of the virus, all government and public health organisations had to reorganise healthcare services and institute restrictions on travel including lockdowns. And although they reduced the number of people who died and were hospitalised with Covid, what has become more apparent over the course of the pandemic is that the interruption of care for non-Covid 19 conditions will also lead to significant morbidity and mortality.
Supporting urgent policy needs during a global crisis.
The Local Data Spaces project is a multidisciplinary collaboration between the Consumer Data Research Centre (CDRC), Joint Biosecurity Centre (JBC), the Office for National Statistics (ONS), and ADR UK. It was set up to support local authorities, groups, and stakeholders response to the pandemic using granular, secured data and research-driven analyses. The research has been recognised as innovative, winning the prestigious ONS Research Excellence Project Award 2021 for delivering public good and informed policy decisions.
High-resolution demographic projections using SPENSER.
SPENSER (Synthetic Population Estimation and Scenario Projection Model) is a set of open-source tools for population estimation and projection, funded by the Alan Turing institute. It is a partnership between academic and private sector researchers that introduces a novel data-driven modelling framework to run complex simulation models.
A food secure future is vital for all of us, but food production is facing many challenges, both in the global north and the global south.
Researchers from the University of Leeds are tackling these major issues and working together to drive innovation in climate-smart food systems. The work, using complex machine learning is enabling a greater understanding of crop yields and knowing what is grown where using remote sensing data, and is underpinned by a combination of existing work on crop models, AI technology, and deep learning (a subset of machine learning).
Measuring volcanic deformation from space to enable global monitoring.
Volcanic eruptions are obviously problematic, not only presenting danger to life but also posing significant economic impacts. Some volcanoes have dedicated volcano observatories staffed by experts ranging from seismologists to traditional geologists and people who analyse gas. In some cases, they will be working with satellite remote sensing data. However, about 90% of land volcanoes aren’t really monitored.
Climate models don’t agree on how clouds will change in our future climate, which has huge implications for our ability to predict how much the Earth will warm.
It’s impossible to model every single meter in the atmosphere and so climate models must model the effect of clouds. But studying the clouds by applying self-supervised machine learning directly to the vast amounts of satellite data available provides a unique opportunity to understand cloud formation and therefore improve climate models.
Clinical trials are not only expensive and time-consuming, they also compromise animal welfare and potentially put human volunteers at risk.
A team of researchers is working on computer modelling that provides a viable alternative – a virtual (aka in silico) clinical trial. “Medical devices follow a lengthy evaluation through bench to animal experiments, and then through human trials, with a tiny amount of scientific evidence currently derived from computer modelling and simulation. The cost is ever-increasing, delaying lifesaving benefits to patients, and they are not without fundamental limitations.
Access to cardiac MR imaging centres is limited in developing countries, scanners being expensive and difficult to acquire. Even in developed nations like the UK, there is a limited number of cardiac MR imaging centres and waiting lists are long.
A multidisciplinary study, led by Dr Nishant Ravikumar, Lecturer in Computer Science, and Prof. Alejandro Frangi who holds the Diamond Jubilee Chair in Computational Medicine, at the University of Leeds, looked at whether an alternative, more cost-effective, and scalable solution could be found to detect indicators of cardiac disease, especially of Myocardial Infarction (MI).
Revolutionising cycle policy, planning and investment with the Propensity to Cycle Tool.
A collaborative, multi-disciplinary and multi-institution project initiated by Dr Robin Lovelace, Asst. Prof. Transport Data Science at the University of Leeds, the Propensity to Cycle Tool (PCT) is revolutionising strategic cycle planning in England and Wales and beyond.
The Propensity to Cycle Tool was developed by team from the Universities of Leeds, Cambridge, Westminster and LSHTM with funding from Department for Transport (DfT) in the UK and the ESRC Consumer Data Research Centre (CDRC).
LIDA Data Scientist Internship Programme
LIDA’s Data Scientist Internship Programme offers early career data scientists the opportunity to hone their expertise working on real-world data and topical research questions, whilst giving academics and LIDA’s partners the chance to harness new talent in finding novel ways to solve data problems. Since its inception in 2016, it has provided 52 paid interns with hands-on experience and ownership of projects drawn from the fields of social science, health and engineering, using data science for public good.
Centres for Doctoral Training
LIDA hosts the Centre for Doctoral Training Data Analytics and Society and the Centre for Doctoral Training for Artificial Intelligence for Medical Diagnosis and Care.
The ESRC-funded CDT for Data Analytics and Society provides postgraduate research training across the Universities of Leeds, Liverpool, Manchester and Sheffield.
To read more about the activities and successes over the last year, please click the link below.
The CDT AI for Medical Diagnosis and Care is now in its third year. The Centre, the grant for which is based in Computing, has found an excellent home in LIDA, where its students benefit from the opportunity to build a truly interdisciplinary network of highly talented researchers. The focus of the CDT is on AI in cancer care.
LIDA Affiliated Research Centres
LIDA hosts and supports a number of research centres across a portfolio of research, education and training.
In 2014, with substantial investment from the ESRC, the Consumer Data Research Centre (CDRC) was established to lead academic engagement between industry and the social sciences and utilise consumer data for academic research purposes.
Click here to read their Research Review which looks to demonstrate what can be achieved through the analysis of consumer data in multiple contexts.
Immersive technologies enable practitioners, policymakers and communities to visualise and interact with data in a meaningful manner. This is why LIDA Societies and the Centre for Immersive Technologies have combined forces to create an ‘Integrated Data Analytics Unit’ that brings together experts in data analytics and visualisation across Leeds and Bradford. The Integrated Data Analytics Unit is now leading a Regional effort to tackle inequality known as Digitally Acting Together As One (DATA 1).
Education and Training
In my first six months as Education & Training Director it has been fascinating to learn about the breadth and depth of LIDAs activities. It is obvious that all staff & students involved in education & training have responded, and continue to respond, with great ingenuity and innovation to the rapid changes brought about by the pandemic and have exciting plans for the future.
Click the link below to hear more from our new Director for Education & Training Professor Paul Baxter.
Research and Technology
LIDA launched a cutting-edge platform called LASER (the Leeds Analytic Secure Environment for Research) in April 2021 (https://forstaff.leeds.ac.uk/news/article/7468/powering-data-analytics-research-education-and-training). LASER is a Microsoft cloud-based platform that meets the highest standards of data security while providing increased capacity and flexibility to meet the diverse data analytics needs of researchers and students at the University. The project is the most complex IT project that the University has undertaken to date, a true collaboration between IT, LIDA and external suppliers underpinned by a £1.5 million strategic investment by the University.
We believe in innovation through collaboration and work closely with a wide range of data partners in the public, private and third sectors to ensure the Institute’s academic expertise is channelled towards addressing real-world challenges.
LIDA is currently working with a number of key partners. Please click the links below to read more about these collaborations.
The Alan Turing Institute is the UK’s national institute for Data Science and Artificial Intelligence. In recognition of its world class academic research, the University of Leeds has been a partner of the Institute since 2018. Led by the Leeds Institute for Data Analytics (LIDA), the partnership benefits from contributions from researchers and data scientists across all University faculties as well as from LIDA’s many established research connections with external partners in business, government and the third sector.
As one of the country’s largest and most renowned civic universities, the University of Leeds has always had a strong mission for impact and partnership with its city region. This potential has also been recognised in LIDA since the outset, for example through joint PhD projects, data science interns and post-doctoral research on major programmes such as Quanticode and the CDRC. For a time, this interaction was accelerated further by co-location of a substantial group of analysts from Leeds City Council on Worsley Level 11.
DATA-CAN is incredibly proud of its partnership with the Leeds Institute for Data Analytics (LIDA) – a relationship which has grown and developed since Health Data Research UK launched the eight initial data hubs in 2019.
As the UK’s health data research hub for cancer, our aim is to make high quality health data more accessible for cancer researchers and health professionals, to help improve cancer services and patient outcomes. LIDA was one of DATA-CAN’s founding partners and has been at the forefront of DATA-CAN’s work over the last year.
The LIDA-Sainsbury’s partnership continues to go from strength to strength, with research outputs emerging in both strategic priority areas: Property Analytics and Health.
In May 2021 we announced a new partnership with the IGD (Institute of Grocery Distribution) and their Healthy and Sustainable Diets Project group, involving 20 partners from food retail and manufacturing. We have worked together to identify key metrics to evaluate the success of pilot trials to encourage healthier and more sustainable diets, using the Eatwell Guide as a model.