Specifying and Optimising Data Wrangling Tasks
Primary supervisor
Additional information
- Sampaio, Sandra ; Al-Jubairah, Mashael ; Permana, Hapsoro Adi ; Sampaio, Pedro. A Conceptual Approach for Supporting Traffic Data Wrangling Tasks. In: The Computer Journal. 2018 (to appear).
- What is Data Wrangling?
Contact admissions office
Other projects with the same supervisor
Funding
- Self-Funded Students Only
If you have the correct qualifications and access to your own funding, either from your home country or your own finances, your application to work with this supervisor will be considered.
Project description
Data wrangling is "the process of cleaning, structuring and enriching raw data into a desired format for better decision making in less time" [2].
To clean the data prior to analytical tasks, a wide variety of data quality techniques and tools are used [1]. There is also a trade-off between flexibility, performance and usability of data quality techniques and tools [1]. Highly flexible tools tend to overburden the end user with the need to complex application programming interfaces towards expressing quality-aware manipulations over the data. The balance lies somewhere in a spectrum between highly flexible and extensible solutions and less flexible but efficient and user-friendly frameworks. In practice, a combination of complementary tools and techniques may be needed in a data quality management project.
This PhD project aims to investigate popular techniques and tools used by data scientists to conduct data wrangling tasks prior to big data analytics and develop domain specific methods and languages.
Person specification
For information
- Candidates must hold a minimum of an upper Second Class UK Honours degree or international equivalent in a relevant science or engineering discipline.
- Candidates must meet the School's minimum English Language requirement.
- Candidates will be expected to comply with the University's policies and practices of equality, diversity and inclusion.
Essential
Applicants will be required to evidence the following skills and qualifications.
- You must be capable of performing at a very high level.
- You must have a self-driven interest in uncovering and solving unknown problems and be able to work hard and creatively without constant supervision.
Desirable
Applicants will be required to evidence the following skills and qualifications.
- You will have good time management.
- You will possess determination (which is often more important than qualifications) although you'll need a good amount of both.
General
Applicants will be required to address the following.
- Comment on your transcript/predicted degree marks, outlining both strong and weak points.
- Discuss your final year Undergraduate project work - and if appropriate your MSc project work.
- How well does your previous study prepare you for undertaking Postgraduate Research?
- Why do you believe you are suitable for doing Postgraduate Research?