In this project extension, we are evaluating the applicability of Machine Learning (ML) to automate these tasks, with a desire to improve our current performance of the average swapped pairs percentage in ranking from 21.22 for a system using the advertisement text alone to promising 9.42 for one that enriches it with the skills highlights on a hold-out test set of 105 advertisements. In particular, we are focusing on human-in-the-loop approach of active ML and deep/transfer ML to maximise processing correctness whilst minimising the amount of training data. The concrete goal is to engineer an an online visualisation system that predicts the ranking and highlighting; quantifies what the most sought-after PhD skills; characterises how the advertising that seeks for PhD skills in Australia is in terms of the geographic location, industry sector, job title, working hours, continuity, and wage; and uses interactive visual feedback to train both people and machines to improve their inter- and intra-annotator agreement.
This project will appeal to students with excellent skills in experimentation, programming, and teamwork. The preference is on students who have finished/are taking the units of Artificial Intelligence, Document Analysis, and/or Machine Learning in The ANU or similar.
This student project is a part of the activities of the NLP Team within ML Group in The Australian National University (ANU) and Data61 in Canberra, the capital of Australia. The OECD Regional Well-Being Report 2014 evaluated Canberra as the most livable city in the world.
The ML Group has been recently (in 2014) ranked among the top five in the world in ML, the others being Microsoft Research, Max Planck Institute Tübingen, University of Berkeley, and University of Cambridge. According to the QS World University Rankings for 2015-16, The ANU ranks within the top-20 universities globally with the overall score of 91.0 out of 100.0 (19th) whilst the next best Australian university scored 83.1 (42nd) and for the field of research (FOR) code of AI and Image Processing, applicable to ML and NLP, under Information and Computer Sciences, The ANU has obtained the top 5 out of 5 score in the Excellence in Research for Australia (ERA) evaluations, both in 2010 and 2012.
The NLP Team is experienced in developing powerful low-cost techniques to free-form text them into structured representations. Our deep and transfer ML methods are able to use less than a hundred expert-annotated sentences to achieve performance comparable to the state-of-the-art systems, initialised with ten times more data. Similarly, our language processing methods have been among the finest elite in the ALTA, CLEF, and TREC shared tasks on automated understanding, use, summarisation, and translation in difficult genres of “Doctors’ Latin” in electronic health records and “Lawyers’ French” in patents.
Active Learning, Artificial Intelligence (AI), Big Data, Data Analytics, Deep Learning, Machine Learning (ML), Natural Language Processing (NLP), Transfer Learning, Visual-Interactive Text Search and Exploration