Data Scientist Internship Programme
Data Scientist Internship Programme
LIDA launched the Data Scientist Internship Programme in September 2016, as part of our commitment to developing data science capability and driving multidisciplinary data analytics research. The eleven appointed interns are currently working alongside researchers from across the University on a variety of projects.
The programme provides the opportunity for interns to:
- Own delivery of a data science project and get hands on experience using real data
- Establish links with project partners and work to provide solutions to real world challenges
- Build skills and knowledge in advanced analytics
- Participate in on-site training opportunities in statistical analysis, visualisation, research methods and computer programming
- Work alongside leading scholars as part of the LIDA team and gain valuable work experience
‘I completed a LIDA Internship working with Sky Betting & Gaming to develop models to identify problem gamblers. My time at LIDA was essential to my transition from academia to industry. Working in collaboration with academic and industry partners allowed me to develop my understanding of business challenges, while maintaining a solid academic grounding. Having the support of academic supervisores helped ensure my work was rigorous; an ethos I have carried over into my current role as a Data Scientist at Sky Betting & Gaming’
Ed Berry, Data Scientist – Sky Betting & Gaming
Early economic modelling and budget impact analysis of Prolaris® test to aid the treatment management decisions in prostate cancer patients
Identification of Clinical Factors Associated with Poor Surgical Outcomes in a Primary Care Dataset
Does fabric tactility affect clothing product sales?
An area classification of consumer vulnerability in the UK
Application of natural language processing for identification of online hate on Twitter
Assessing the effectiveness of the e-petition procedure through Twitter conversations
Catch! – real time simulation of daily travel patterns
Intern Profiles 2018/19
Alex Coleman
Education:
BSc (Hons) Molecular Biology, University of Liverpool
PhD Molecular Virology, University of Leeds (Submitted)
How I became interested in data:
During my PhD one aspect of my project was applying a technique that took a snapshot of proteins within a cell and how those proteins changed over the course of a viral infection. In order to process the large data sets that came out of these experiments I learnt R and python and developed a series of R scripts to help rapidly and reproducibly identify key hits for further experimentation. I also applied these skills to volunteer work with my local Labour party through which I’ve developed and implemented a volunteer mobilisation app in R using Shiny (www.mynearestleedsmarginal.com).
Skills and experience I hope to gain from the internship:
I’m keen to use the internship as an opportunity to formalise my skill set in ‘R’ and ‘Python’ and acquire best practice experience building code and machine learning models. I’d also like to explore a variety of different topics outside my previous experience to learn about developing real-world solutions with data. Finally, I am looking forward to developing a broader understanding of the philosophical and ethical implications of data analytics at LIDA and help communicate this to stakeholders, policy makers and the general public.
Where I hope to be in three years:
I would aspire to be applying the skills I’ve acquired at LIDA in a role invested in enhancing the public good through data science in either the public or private sector.
Gonzalo Cruz
Education:
MSci in Physics from the University of Nottingham
How I became interested in data:
I have always enjoyed science and mathematics, but the link to data became more obvious at university. Throughout my degree, I had to use a variety of data to model real scenarios. I realised that statistical analysis, data visualisation and gaining insight from seemingly chaotic data was most appealing to me. I recently came across the idea of industry 4.0; the great potential capabilities in A.I. and machine learning caused by the vast availability of data. The constant technological advancements, in addition to the variety of fields you can contribute to, is what really motivates me to work in Data Analytics.
Skills and experience I hope to gain from the internship:
The most important factor for me is personal development. At LIDA I am surrounded by people from a huge diversity of backgrounds, working on a plethora of projects so there is always something new to learn. On top of this, the access to training both within the centre and through university lectures means I can complement my knowledge and get a solid basis for whatever new concepts I encounter in my projects. I hope to strengthen my coding skills on Python as well as how to visualise data in a more efficient manner, perhaps learning new software. I am looking forward to taking ownership of a project so I improve my decision making and communication skills. At the end of the day, I want to gain a particular set of skills that I can use in my next position and help propel my career forwards.
Where I hope to be in three years:
I would like to be working as a Data Scientist, eventually become more client based and do some consulting.
Ivana Kocánová
Education:
BSc in Banking & Finance Including Year Abroad from the University of Essex
MSc in Business Analytics from the University of Nottingham
How I became interested in data:
Through my bachelor’s course I gained fundamental knowledge of business. However, during this time, I realized that a practical way to achieve operational efficiency and improved decision-making is through a deeper understanding of data. My interest in these areas first sparked during my year abroad, where I had the chance to study modules that integrated Databases, and Systems Analysis and Design. During my Master’s I was learning to code in Python, writing complex queries in SQL, how to visualise data in Tableau and various Machine learning algorithms. On Data Science I mainly appreciate its variability and potential use in any field.
Skills and experience I hope to gain from the internship:
I am hoping to gain experience with different types of datasets and projects. I would like to continue deepening my knowledge of Python, SQL and Tableau and learn R. I am also looking forward to learning more about spatial analytics. Furthermore, this internship offers a unique opportunity to work within a group of experienced researchers and leading scholars.
Where I hope to be in three years:
Working as Data Scientist in either industry or academia.
Jack Lewis
Education:
BSc in Geography from Sheffield Hallam University
MSc in Geographical Information Systems
How I became interested in data:
I first became interested in data from exploring my passion for mapping in my masters degree. Maps provide a platform in which it is easier to visually understand the volume of data often used to create an image, however I subsequently desired to develop an understanding of the way in which we process data. I managed to get involved with the Masters Research Dissertation Scheme under the CDRC and used SQL to explore retail centre attractiveness using GPS data and fell in love with data analysis.
Skills and experience I hope to gain from the internship:
I feel my skills stem from a geographical perspective therefore I really want to explore different techniques for data analysis. So far I have enjoyed using pioneering and novel methods and I want to discover what techniques are being used at the forefront of research here at LIDA. My knowledge of coding languages is limited and I feel like the opportunity presented to me by this internship is perfect to develop the skills necessary for industry or future research. I feel I will mostly benefit from the variety of different training courses offered here at LIDA which will help me to explore the range of different disciplines data science is useful for.
Where I hope to be in three years:
In three years’ time I hope to be a competent data scientist though I am currently unsure what area of expertise I would like to work for. So far I have enjoyed exploring retail, though the variety of different subjects data science is useful for excites me as I will be working on different projects.
Jodie England
Education:
BSc in Genetics from the University of Leeds
MSc in Molecular Medicine
How I became interested in data:
As a geneticist I have experience handling large, complex datasets from patient and public genome databases, and identifying trends in gene variant expression patterns within groups of individuals that suffer from genetic disorders. Further bioinformatics and statistical analysis skills gained from my Undergraduate and Postgraduate degrees have given me a keen interest in improving my data science skills for application in further research.
Skills and experience I hope to gain from the internship:
I hope to become more knowledgeable in programming languages such as R and Python, and utilise these in the handling of large datasets in order to identify trends and visualise data in a competent manner. Through handling a wide range of data types that are new to me, I hope to develop transferrable data analysis skills that I can then apply to research in the future. I also hope to develop my presentation and communication skills aimed at a wider audience, and network within LIDA in order to connect with industry and academic contacts.
Where I hope to be in three years:
I hope to undertake a PhD and further academic research, and I am open to exploring opportunities in new academic fields or areas of research.
Kevin Minors
Education: MMath degree in Mathematics at University of Oxford
PhD in Mathematics at University of Bath
How I became interested in data:
One of my first experiences with data occurred during my PhD. I was working on a project that aimed to model the movement dynamics of a fish population. This multidisciplinary research included working with ecologists to design experiments that allowed us track the movement of the fish. Analysing this tracking data first sparked my interest in data. Once I discovered the field of data science and machine learning, I was hooked!
Skills and experience I hope to gain from the internship:
Improved applied programming in Python, R, and SQL would be an ideal skill to take away from this internship. I hope as well to spend a significant amount of time working with academics and industry partners here at LIDA to gain experience tackling data science and machine learning problems from a variety of perspectives. In addition, I am looking forward to getting a better understanding of the similarities and differences between academia and industry and the role data science plays in both.
Where I hope to be in three years:
Working as a data scientist in a challenging and engaging environment. I will have significant experience collecting data from numerous sources, cleaning and processing data, training a variety of machine learning tools, and creating beautiful and informative visualisations to generate actionable insights.
Luke Archer
Education:
BSc Biomedical Sciences from the University of Hull
MSc Bioinformatics and Systems Biology from the University of Manchester
How I became interested in data:
I wrote a literature review in the final year of my undergraduate on Next Generation Sequencing in Clinical Microbiology and how it compared to current techniques. It was fairly obvious that high throughput techniques were orders of magnitude better than their predecessors, and the ability to gain massive amounts of data about a pathogen in a relatively short space of time could transform epidemiology and outbreak monitoring. My interest spread into other forms of biological data, and so the logical next step was to pursue Bioinformatics for my MSc.
Skills and experience I hope to gain from the internship:
I hope to branch out from my biological background, and experience the real world application of data science in business and urban analytics. I want to expose myself to brand new types of data and analysis, and get an idea of how the skills I learn during the internship can be applied in different industries and avenues of research. In particular, I want to develop certain skills that will be crucial to a career in data science, such as project management, quickly learning new programming languages and technical skills, and being able to communicate my work to everyone from experts in the field through to the public at large. Finally, I would like to build a strong and varied network both within LIDA and outside, in academia and industry. I hope that my work in this internship will lead me to an enjoyable and successful career in data science.
Where I hope to be in three years:
Working as a Data Scientist in industry, possibly in a large city in the UK, but I would particularly like to be working abroad somewhere like America or Australia.
Robert Clay
Education:
MMATH in Mathematics and Statistics from the University of Manchester.
How I became interested in data:
My interest in data stems from a strong passion for sport and hockey in particular. Metrics have long been developed and argued between professionals and fans alike to help transform player data into who is the ‘best’. This has led me into a degree in which applied statistics was prevalent and exposed me to a wide variety of different datasets. This ranged from particular types of data such as time series and longitudinal data to a project investigating new methods of regression analysis and their implementation in R to complex data.
Skills and experience I hope to gain from the internship:
I hope to use this opportunity to explore beyond my mostly theoretical degree and apply myself to real situations with the potential to improve quality of life or service. I aim to further expand upon my R. knowledge working on my ability to present information visually in figures and graphs. I also want to vastly expand on my current python skill and use it to implement more impressive data science techniques including machine learning. I wish to gain experience networking and collaborating while working within LIDA to benefit me both socially and in making useful contacts for the future.
Where I hope to be in three years:
In three years I would hope to continue working on long-term data science projects whether they be in industry or academia. I would be particularly interested in moving abroad with interest in moving to North America or Asia.
Stelios Theophanous
Education:
BSc in Biomedical Science from the University of Sheffield
Msc in Bioinformatics from the University of Manchester
How I became interested in data:
I became interested in data during my Master’s degree, where I learned how to code in Python, R and Java. For my dissertation, I applied my programming skills to analyse real-world data and tackle problems in the fields of cancer and genetics. I have since been fascinated by the growing power of data across various disciplines and discovered that data analysis can help drive powerful change in the world.
Skills and experience I hope to gain from the internship:
I hope that through this internship I will further improve a range of technical skills, including project management, technical writing and programming. Moreover, I would like to work in a multidisciplinary setting, exchanging ideas with colleagues from different backgrounds. This will help me overcome the various challenges I will face throughout the internship allowing me to deliver a successful project, expand my professional network and gain a broader perspective of data science in general. I would also like to use the internship as an opportunity to present to a wider audience and enhance my communication skills.
Where I hope to be in three years:
I hope to be studying for a PhD, carrying out an exciting project in the field of cancer informatics.
Rizwana Uddin
Education:
Bsc in Biochemistry from the University of Huddersfield
How I became interested in data:
My first experiences of data analytics were through lectures at university that taught the use of the R programming language. During my dissertation project I used ‘R’ in combination with SPSS, which I thoroughly enjoyed and lead me to further exploration of the ways that data can be used in medical bioinformatics in addition to healthcare. I was fascinated by the ability to predict outcomes based on data collected in fields such as predictive medicine.
Skills and experiences I hope to gain from this internship:
I hope to further my knowledge on ‘R’ and gain more experiences with handling big data. I also hope to develop skills in using software packages such as ‘Python’ and ‘SQL’ which I have not had previous experiences with. Moreover, I would like to use opportunities of working within a wide multi-disciplinary team to gain experiences of sharing information with both academic personal and the general public as well as gain valuable contacts.
Where I see myself in three years’ time:
Working as a data scientist in medical bioinformatics.
Benjamin Wilson
Education:
Computing BSc Leeds Beckett University
How I became interested in data:
My initial intrigue in data analytics started with reading about new methods in the life sciences and learning how computing has become more important to many forms of research. I realised that I wanted to take a more altruistic approach to my computing career because of this.
I wondered how I might apply my computing skills to work in research and development of assistive technologies. This made me pick up a data science project for the final year of my undergraduate degree. This had been an exciting and challenging adventure. I had looked at various datasets, from neuro-images to climate data. I eventually set the scope to a binary classification project to predict malignancy in tumours to try and grasp the fundamentals and potential of how data may be used today and in the future within the life sciences.
Skills and experience I hope to gain from the internship:
I hope that by the end of this program, I will be able to start my own projects in full confidence having had developed my research skills for a cross-disciplinary career. I wish to understand the contextual areas of data science so that I can better understand what fields I enjoy working in. I also hope to develop my skills in mathematics and statistics.
Where I hope to be in three years:
I would like to be working on the cutting edge of research. Perhaps developing tools and methods for data scientists.