Home
We are a team of Research Data Specialists working to enable data-intensive research across the disciplines.

Our team, experienced in a diverse range of disciplines, hold expertise in:
- Research study design
- Data management
- Data stewardship
- HPC and cloud computing
- Data analysis
- Machine learning
- Computing infrastructure
- Natural language processing
- Image and video processing
- Virtual/augmented reality
- Data visualisation
- Web development
- Community building
- Data collection methods

We invite researchers to engage with us for 3 – 6 month-length projects twice yearly via a formal collaboration process. We are also able to offer consultancy, expert advice, and advocacy to researchers on shorter length projects from time to time. And, if we don’t have the expertise on hand, we can redirect to other research enabler groups and services across the University who will.
We are enabled by the University’s Petascale Campus Initiative (PCI).
CONTACT US:
We encourage researchers to contact us anytime by email: mdap-info@unimelb.edu.au
-
Professor Andrew Turpin
MDAP Director
-
Jo Condon
Communications Specialist
-
Kim Doyle
Research Data Specialist
-
Dr Noel Faux
Senior Research Data Specialist
-
Dr Emily Fitzgerald
Research Data Specialist
-
Jonathan Garber
Research Data Specialist
-
Zaher Joukhadar
Research Data Specialist
-
Dr Aleks Michalewicz
Research Data Specialist
-
Dr Simon Mutch
Senior Research Data Specialist
-
Usha Nattala
Research Data Specialist
-
Priyanka Pillai
Research Data Specialist
-
Dr Mar Quiroga
Research Data Specialist
-
Dr Daniel Russo-Batterham
Research Data Specialist
-
Bobbie Shaban
Research Data Specialist
-
Dr Andrew Siebel
Platform Manager
-
Dr Kristal Spreadborough
Research Data Specialist
-
Dr Edoardo Tescari
Senior Research Data Specialist
-
Karen Thompson
Senior Research Data Specialist
-
Robert Turnbull
Research Data Specialist
-
Geordie Zhang
Research Data Specialist
Engaging with MDAP
We offer two primary streams of engagement:
-
- Collaborations: an allocation scheme for formal collaborative research projects lasting at least three months, with open calls occurring twice yearly. For more information, please review answers to our Frequently Asked Questions.
- Consultancy, expert advice, advocacy and involvement in shorter term research projects. You can email us at mdap-info@unimelb.edu.au to discuss how MDAP can work with you on your collaborative, data-intensive project.
2021 MDAP collaborations
Expressions of interest are now closed for the 2021A round of collaborations. Proposed collaborations should:
- Have an anticipated duration of three to six months;
- Be of strategic value to the University, aligning with its research strategy, increasing future research capabilities, and leading to the development of novel tools and techniques that can be applied to other projects and domains;
- Require expertise and/or infrastructure that cannot be sourced from existing University Services or platforms; and
- Align with available MDAP team capabilities.
Applications will follow a two-stage process, consisting of a short expression of interest followed by a formal proposal. Collaborations will be ranked based on merit by a Resource Allocation Committee made up of academics from a diverse cross-section of University disciplines. The selection criteria used to prioritise collaborations can be found below.
The timeline for this round of EOIs and subsequent applications is:
EOIs open: Monday 26 October 2020
EOIs close: Sunday 15 November 2020, 11:59pm
Applications open: Thursday 26 November 2020
Applications close: Friday 18 December 2020, 11:59pm
If you have any questions, please consult our FAQs and/or contact us at mdap-info@unimelb.edu.au
Evaluation Criteria
Research collaborations will be evaluated on these criteria, on a scale from 0 to 10 (low to high score):
- Strategic value for the University
(for example but not prescriptive: building new transdisciplinary links, strengthening an emerging area, communicating research impact).
- Strategic value for the research group, including capability development
(for example but not prescriptive: training ECRs/staff, gathering pilot data for grant submissions, producing open source software/tools).
- Alignment with MDAP capability.
- Likelihood of producing stated outcomes within timeframe.
2020 Collaborations:
- The creative industries in days of isolation - a fast pace shift to making, learning and living in a crisis
Chief Investigator: Dr Kathryn Coleman (MGSE)
MDAP Collaboration Lead: Dr Kristal Spreadborough
We are witnessing a cultural and pedagogical evolution. The current global crisis triggered by COVID-19 has caused an unprecedented societal shift. This has been starkly epitomised within the arts, creative industries, and education sectors – our first responders to societal change. These sectors have seen rapid shifts in their norms, practices, and pedagogies and have had to creatively adapt to ensure their survival and longevity in an environment mediated by physical isolation, social disruption, and health and economic crises. Drawing on a diverse range of big data sources, this research takes a data-led approach to examine the interplay between data as practice and practice as data. It maps the evolution of norms, practices, and pedagogies within the creative industries in preparation for the “new normal”. - The megalithic jar sites of Laos
Chief Investigator: Dr Louise Shewan (Science)
MDAP Collaboration Lead: Dr Aleks Michalewicz
This interdisciplinary collaboration involves the integration of data generated from the Plain of Jars Archaeological Research Project: UAV acquired imagery, large point clouds of archaeological sites, photographs of more than 1000 megalithic jars, photogrammetry models of burials, analyses of human remains (isotopic, ancient DNA, osteological, synchrotron Micro-CT), artefact conservation and analysis (ceramic, beads, metals), 3D models of material culture, dating analyses, and the geological characterisation of jar and quarry samples. Work with MDAP involves data management and curation, virtual/augmented reality and machine learning capability to create educational/research offerings and to facilitate ongoing archaeological research in inaccessible environments contaminated with unexploded ordnance (UXO), visualisation proficiency to share project outcomes, image and video processing, data analysis, and innovative web development to ensure open access to dynamic and unique datasets. - Causality in Complex Dynamical Systems: Implementing Core Algorithms in a Distributed Computing Environment
Chief Investigator: A/Prof Michael J. Zyphur (FBE)
MDAP Collaboration Lead: Dr Edoardo Tescari
This project will revolutionize the ability of social and physical scientists to characterize and assess causal effects in complex dynamical systems. These systems are difficult to study using traditional methods including experiments and model-based regressions because their assumptions are easily violated by nonlinear system evolution and causal effects. The broad aim of this project is to implement distributed computing methods for analyzing data from complex dynamical systems. The project will implement two core empirical dynamic modeling (EDM) algorithms on a distributed computing platform with a manifold learning method for data imputation and denoising. - Automating image segmentation and morphometric analysis for accelerating neuroscience at the nanoscale
Chief Investigator: Dr Calvin Eiber (MDHS)
MDAP Collaboration Lead: Zaher Joukhadar
The aim of this project is to develop automated image processing tools to identify and measure nerve cell axons in electron microscope (EM) images. We aim to do this in three stages: by implementing existing algorithms for the automatic segmentation and analysis of myelinated axons from conventional transmission EM images, by developing new algorithms to handle unmyelinated axons, and by developing algorithms and data handling processes for the automatic processing of 3D tissue volumes generated by cutting-edge EM techniques like serial block-face scanning electron microscopy (SBF-SEM). The broader goals are to improve our understanding of how the brain communicates with the organs of the body, as well as how different brain regions communicate with one another, in order to support the growing field of bioelectronic medicine. - An Empirical Study of How the US Termination Right Operates (part of the Author’s Interest Project)
Chief Investigator: A/Prof Rebecca Giblin (Law)
MDAP Collaboration Lead: Dr Daniel Russo-Batterham
The Author’s Interest Project is investigating how changes to copyright law could help increase the opportunities authors have to profit from their work, whilst improving public access to knowledge and culture. We’re particularly interested in reversion rights, which see copyrights returned to authors after a certain amount of time or when they are no longer being exploited. More than half the world’s nations have statutory reversion rights, but their effects are woefully under researched. This sub-project is to investigate the operation of a US reversion law that allows authors to terminate copyright grants 35 years after they are made. There have been no in-depth studies investigating how often this scheme is used, by who, and with what effects. Using the publicly available US Copyright Office registration database, the Project intends to fill this gap, contributing rigorous new evidence to debates about what kind of rights can help get authors paid and books read. - Mapping our emotional lives: Building a searchable database of experience sampling data on emotional processes
Chief Investigator: Dr Elise Kalokerinos (MDHS)
MDAP Collaboration Lead: Dr Daniel Russo-Batterham
Experience sampling involves following participants into their everyday lives using smartphones, taking multiple measures in real-time to get a more meaningful picture of psychological processes. Experience sampling methods are central to understanding both normal and disordered emotional responding: they allow us to capture strong and meaningful emotional events that can’t be recreated in the lab, to map dynamic fluctuations of emotions across time, and to study how these processes contribute to psychological well-being. However, conducting experience sampling studies is costly, time-intensive, and requires expertise. In addition, research using these methods generates large, rich datasets, which are often not used to their fullest potential. To reduce the barriers to using experience sampling data, and to harness the reanalysis potential of these datasets, we are building an open-access, searchable, and cumulative database of our existing experience sampling datasets. - Natural Language Historical Mapping
Chief Investigator: Dr Mitchell Harrop (Arts)
MDAP Collaboration Lead: Dr Emily Fitzgerald
The Melbourne History Workshop (melbournehistoryworkshop.com) in the School of Historical and Philosophical Studies is working on an Australian Research Council Linkage Infrastructure, Equipment and Facilities grant on the challenge of Time-Layered Cultural Mapping (TLCMap.org). TLCMap is an open online facility to create maps and timelines and combine new Australian cultural and historical datasets with existing ones. The project is developing infrastructure around the Gazetteer of Historical Australian Placenames (GHAP) API and Recogito (https://recogito.pelagios.org). This includes Named Entity Recognition, time recognition, Sentiment Analysis, and other common text processing techniques. We will be working with MDAP to apply these techniques to existing datasets of Encyclopedia of Melbourne entries (http://www.emelbourne.net.au), historical news stories from the National Library’s Trove (https://trove.nla.gov.au), podcasts (http://mymarvellousmelbourne.net.au), tweets and three photographic collections. The application of these techniques will result in the improvement of the original datasets for use by amateur and professional researchers.
Our methods will be documented in a cookbook-style publication, leveraging the strong background of the MDAP workforce in pedagogy and translating highly technical research methods to an audience that includes both Graduate Researchers and even some amateur historians. - Early Modern Women Translators in Europe (15th-18th c.)
Chief Investigator: Prof Véronique Duché (Arts)
MDAP Collaboration Lead: Bobbie Shaban and Dr Emily Fitzgerald
Building on a previous research project (Histoire des Traductions en Langue Française, XVe- XVIe siècles, ANR project – equivalent to ARC – finished in 2015), chapters published (Duché, 2015, 2016, 2017) and papers given at international conferences (RSA, SRS, 2018), this project will further investigate Early modern Women translators in Europe. It seeks to promote cultural awareness by charting the contours and transnational connections of early modern women translators through the use of a sophisticated, interactive visualisation tool. In particular, the project aims to:- identify the early modern women translators (not only in France, but also in other European countries);
- map the intellectual as well as personal and cultural networks to which these women belong (author, publisher, reader etc.);
- create a repository for metadata on early-modern scholarship.
- Unlocking published metagenomes as a source of information for microbial eukaryotes
Chief Investigator: A/Prof Heroen Verbruggen (Science)
MDAP Collaboration Lead: Dr Mar Quiroga and Bobbie Shaban
Microbial eukaryotes play key roles in earth’s ecosystem and human lives, but knowledge of their biodiversity and evolution are very incomplete. This collaborative project between MDAP, the Faculty of Science, Faculty of Medicine, CSIRO and Royal Botanic Gardens Melbourne aims to bring big data to this field of research, advancing our knowledge of microbial eukaryotic biodiversity and evolution by extracting their organelle genomes (mitochondria & chloroplasts) from publicly available metagenomes. Through this project, we aim to achieve: (1) A fully functioning pipeline for automated download of metagenome reads, their assembly and calculation of contig statistics, (2) A large library of assembled metagenomes generated with this pipeline, permitting testing of alternative strategies for organelle genome binning, (3) Advanced proof of concept for externally funded project proposals based on these ideas. - Development of the PolyMuse Database, to map conservation issues with polymer based plastics in museum collections
Chief Investigator: Dr Petronella Nel (Arts)
MDAP Collaboration Lead: Karen Thompson
PolyMuse is a collaborative Australian Research Council funded project between four universities and six heritage institutions, aiming to develop methods for extending the lifespan of vulnerable polymer based plastics in heritage collections. Data is being collected from different locations and collated into a relational database, with the ultimate aim of aiding analysis and providing access to heritage professionals. This MDAP collaboration will create a "universal-adapter" for bringing data already collected into the database, and to build a streamlined data collection process for future data collection. - Machine Learning and Patterns of Primary Care Utilisation in Cancer Diagnosis and Outcomes
Chief Investigator: Professor Jon Emery (MDHS)
MDAP Collaboration Lead: Zaher Joukhadar and Dr Mar Quiroga
The Victorian Comprehensive Cancer Centre-funded Data-Driven Research Program was the first in Australia to link large-scale primary care and hospital data for the purpose of enabling increased capabilities in health services research. Until now, traditional analytical and statistical methods have been utilised to examine and map patterns in primary care attendances and how these are associated with cancer diagnoses, treatment and outcomes. These are generally based on the existing evidence base to test prevailing hypotheses and apply these to the local context. This project seeks to apply novel analytical and machine learning methods to the large-scale linked data sources in order to generate new hypotheses and further characterise how people with cancer engage with primary care services. Specifically, this project would aim to identify patterns in various aspects of primary cancer attendances prior to a definitive cancer diagnosis (as identified in linked data sources). These could include prescribing, test requests/results, semi-coded fields such as ‘reason for encounter’ and co-morbidities/other conditions captured in primary care management systems.
The project would identify a specific cancer type (i.e. Upper Gastrointestinal) with sufficient linkages between hospital diagnosis and primary care data and apply tools and algorithms to detect patterns present within specific timeframes of diagnosis or treatment. - Evaluation of automated deidentification of hospital and general practice health records
Chief Investigator: Professor Karin Verspoor (MSE) and A/Prof Douglas Boyle (MDHS)
MDAP Collaboration Lead: Dr Simon Mutch and Dr Noel Faux
Text analysis is an emerging field in health data research. Use of textual health data for research purposes has been limited in Australia to date, due to concerns around the personal identifying information they may contain. The aim of this project is to develop, test and evaluate methods to deidentify textual data sets from electronic medical records in both hospital and general practice settings so these data can be securely extracted for further analysis using natural language processing. The project is a collaboration between the School of Computing and Information Systems (Engineering), the Department of General Practice, the School of Population and Global Health, and the Centre for Digital Transformation of Health (MDHS). The results of this project will facilitate larger scale use of clinical textual data for research purposes in Australia and will provide opportunities for future research with world-class collaborators within the University and externally. - Using novel analytic methods to predict and prevent falls and fall-related injuries in older persons with dementia.
Chief Investigator: Professor Gustavo Duque (MDHS)
MDAP Collaboration Lead: Dr Noel Faux
Six million Australians will be diagnosed with dementia in the next 20 years, at a cost of more than $1 trillion. One third of them will die due to traumatic falls, which are, in the majority, predictable and preventable. Although most of the larger longitudinal studies on ageing have included musculoskeletal and cognitive variables that could help to predict and prevent falls in people with dementia, no studies have developed accurate predictive tools to identify those at risk of falling. We propose the development of transdisciplinary links between several major databases including the Canadian Longitudinal Study on Aging (across Canada), the Quebec Longitudinal Study on Nutrition and Aging (Quebec, Canada), the Women’s Healthy Ageing Project (Metropolitan Melbourne), and the Geelong Osteoporosis Study (Geelong, VIC) to combine and compare cognitive and musculoskeletal data from 35,000 participants, and characterise and model the risk for incident falls and fall-related injuries in cognitively impaired and healthy older adults participating in these studies. Our preliminary analyses have demonstrated that Artificial Neural Networks and Factor Mixture Models may improve the performance criteria of fall prediction in older persons compared to classical linear models. We hypothesise that these models could be helpful in the identification of patterns of risk factors for falls and fall-related injuries in older persons with dementia. The results of this strong international collaboration (facilitated by MDAP) will pave the way for the development of new care pathways for people with dementia, including a future application that can be used in clinical practice to screen for fall risk in this population. - Melbourne Pollen: Using Machine Learning to predict pollen counts based on vegetation, weather, land use and geographical data
Chief Investigator: A/Prof Ed Newbigin (Science)
MDAP Collaboration Lead: Usha Nattala
Our vision is to help the millions of Australians affected by allergies to airborne pollen and other allergens by discovering, translating and delivering research solutions to help better manage their condition. Our current immediate project goal is to use advanced machine learning algorithms to produce a novel forecast model based on our extensive pollen databases, citizen science responses and various environmental inputs including satellite weather, landuse and vegetation data. Machine learning techniques make it possible for us to deal with large multivariate datasets; set up forecasting processes that are capable of identifying and extracting patterns in data without explicit instructions; continuously learning and improving as they go along without the need for excessive human intervention - Rural land use classification algorithms for emergency animal disease responses
Chief Investigator: Dr Simon Firestone (FVAS)
MDAP Collaboration Lead: Jonathan Garber
In an emergency animal disease situation, such as an outbreak of African Swine Fever in Australia, accurate classification of current land uses for different types of farming are needed to facilitate an effective response. The aim of this project is to leverage high resolution aerial imagery with state of the art image classification algorithms in order to identify the types of livestock being raised on a farm. Our goal is to build an artificial intelligence pipeline using high resolution imagery stored in mediaflux, and the GPU compute power of the Spartan HPC, in order to predict the land use of a given farm image. Additional GIS layers will be added to the predictive model to improve the accuracy. The output of this collaboration will be used by stakeholders in order to automatically map the geospatial risks and aid in the response to an animal epidemic. - Introducing new analysis modalities to the Stemformatics Stem Cell Atlas
Chief Investigator: Professor Christine Wells and Dr Jarny Choi (MDHS)
MDAP Collaboration Lead: Priyanka Pillai
Stemformatics.org services a local and international community of stem cell researchers, who use the resource to share and interrogate stem cell studies. It is an online portal associated with a database of curated stem cell experiments. The collaboration with MDAP will enhance the capability of the Stemformatics atlas by creating new modalities for analysis and visualisation of data. This involves mining the data to find biologically meaningful features in sample space, developing new visualisation prototypes and query wizards pertinent to these, as well as improving the sample annotations in our databases to enable more powerful data analyses. - Sustainable Energy Advocacy Coalitions
Chief Investigator: Dr Alfonso Martinez Arranz (MSE)
MDAP Collaboration Lead: Kim Doyle
Climate and energy policy is a highly divisive issue in Australian politics; however, beyond partisan caricatures, little is known about the exact nature of the divisions or their boundaries. This project addresses this gap in the scientific understanding of socio-political divisions around energy policy. The project adopts a data-intensive methodology structured in five parts: online data mining, quantitative surveys, semi-structured interviews, Delphi-method survey, and community meetings.
—
2019 COLLABORATIONS
If you would like to view our 2019 collaborations, please step this way.
—
-
Collaborating with Dr Marianne Coleman
In 2020, MDAP collaborated with Dr Marianne Coleman, a new researcher in orthoptics to The University of Melbourne. In this three-part series, Marianne shares a bit about her research and experience working with us.
December 21, 2020 -
Congrats to these two Superstars!
We are so pleased to see our colleagues, Dr Mar Quiroga and Priyanka Pillai become Superstars of STEM, joining sixty other women in a two-year leadership and communication program run by STA.
December 8, 2020 -
Supporting Indigenous governance of Indigenous data
To help celebrate NAIDOC Week 2020, we reflect on one of our recent collaborations with the Kaiela Institute, the Indigenous Data Network (IDN) and Scholarly Services to support Indigenous governance of Indigenous Data.
November 10, 2020 -
Introducing Jonathan
Meet our resident geographer, YouTube star and now Research Data Specialist, Jonathan Garber, who is passionate about knowledge sharing, travel, and helping to solve data and code crimes to address global challenges.
October 29, 2020 -
Introducing Robert
Whether it’s decoding questions about the universe, ancient Arabic texts or monolithic ‘Jars’ in present-day Laos, Robert Turnbull has us covered. Discover his journey from Australia to the Middle East and back again to pursue his life passion in uncovering questions of the past.
September 22, 2020 -
Defining the Third Space
In August, we co-facilitated Defining the Third Space with the Sydney Informatics Hub team: an online event that aimed to explore our data-intensive research support space with the community nationally and abroad.
September 17, 2020 -
Shining in an unusual time
Meet Gus, Lingyu, Lexie, Sophia and Hai Ho, Master of Data Science and Information Technology students who joined the MDAP team in mid-2020, shortly after we all started to settle into remote working life.
September 17, 2020 -
Introducing Noel
Meet Noel, our in-house bioinformatics and genetics specialist with a knack for tackling complex computational challenges in health and biological sciences and enjoying backyard travel in lieu of European getaways for now.
September 3, 2020 -
Shaping the future of data-intensive research
To celebrate our first year at MDAP: a showcase of some of our successful collaborations with researchers across the Faculties, and our ideas on the future of data-intensive research.
July 27, 2020 -
Introducing Aleks
Meet Aleks, our resident archaeologist who brings a passion for uncovering history to help shape the future of research data stewardship at the University and beyond. Here we learn about Aleks' journey to, in, and around her new MDAP 'digs'.
June 9, 2020 -
Third Space: The Depth
Recently we co-organised a virtual event with Sydney Informatics Hub (SIH) titled Third Space: The Depth, the first of hopefully many to explore the data-intensive research support space in different institutions. Here we share presentations from this event and exciting next steps.
June 4, 2020 -
Imagining a building with heart
While we have been self-isolating, the innovative Melbourne Connect precinct has been taking form. At its heart: a major new digital artwork that will live and grow with the building, its occupants, and its environment over time.
May 28, 2020 -
Introducing Zaher
Meet Zaher, one of our talented research data specialists with a passion for turning creative software skills towards boundary-pushing projects at the intersection of art, science, and technology that can benefit humans and animals alike.
May 12, 2020 -
Introducing Andrew
Recently, Dr Andrew Siebel jumped on board as our Platform Manager, helping the team to foster inclusiveness, collaboration, technical innovation, and good spirit from the get-go. Here we learn about his pathway in human and animal biology and a life-long passion for diversity.
April 9, 2020 -
Like interning with Avengers
Recently, MDAP kicked off its internship program with four Master of Data Science students jumping onto real-world research problems with our research data specialist and data steward team. Interns Maria, Rohit, Pratibha and Saket slotted seamlessly into the team...
April 2, 2020 -
Introducing Priyanka
"I don’t think there are any wicked problems on my horizon that can't be fixed." Celebrating #IWD2020 with a Q&A with one of our research data stewards, Priyanka Pillai.
March 5, 2020 -
Data-intensive research for real-world impact
RDA Plenary and C3DIS adapt for online viewing! We thank organisers for their work to enable us to contribute to the new program, and look forward to sharing more video recordings of our presentations here as they become available in 2020.
March 3, 2020 -
Introducing Mar
Introducing Mar, one of our talented research data specialists with a serious knack for problem solving in mathematics, neuroscience, public health, and any data-intensive area!
February 20, 2020