A week at the International Conference on Machine Learning (ICML)

Five NICD data scientists enjoying a day of sightseeing in Washington, D.C.

The 39th International Conference on Machine Learning


ICML (aka the International Conference on Machine Learning) is one of the largest annual gatherings of machine learning practitioners and features cutting-edge research on all aspects of machine learning including AI, statistics, and data science. Talks cover research from areas including machine vision, computational biology, interpretable ML, topology, generative models, and from highly theoretical work to novel applications and everything in between.


A team of data scientists from the National Innovation Centre for Data, including myself (Dr Hollie Johnson), attended the thirty-ninth ICML in July this year, held in Baltimore, Maryland. With our diverse backgrounds in statistics, machine learning, computer vision, visualisation and environmental modelling, we each went to ICML hoping to discover new research to guide the development of our own individual skills and to pass on to the organisations we work with.  


The main conference days included longer twenty-minute talks and short five-minute pitches within themed sessions, as well as keynote talks. On the final two days of the conference, there was also a selection of full-day workshops on a variety of topics. These tended to include longer talks and posters on a theme and provided a great opportunity to really deep-dive into a topic. We also attended the expo, where global companies such as Google, Meta and AstraZeneca were talking about some of their research.


In two of the keynote talks, Regina Barzilay and
Aviv Regev, both from the Massachusetts Institute of Technology (MIT), discussed ways in which machine learning could help solve highly complex problems in the life sciences sector – particularly those relating to drug discovery and validation. It is evident that there is a rapidly growing body of research on ML for life sciences, with many of the research questions being driven by real clinical needs, and drawing on research on graph neural networks in particular.

The harbour in Baltimore, Maryland

A new type of hybrid conference


In light of ongoing concerns about Covid-19, calls to make conferences more accessible to wider audiences, and the argument that we should be trying to reduce international travel, ICML should be commended for their efforts to hybridise the conference this year. That said, whilst well thought-out for remote attendees, in reality the hybrid approach resulted in a strange conference experience for those of us attending in person. For example, all speakers were required to pre-record their talks for remote audiences and, unfortunately, many speakers chose to play their video rather than speak live despite being physically in attendance at the conference. We even observed a few using automated text-to-speech! A great demonstration of the technology, but not a very engaging audience experience.


A small number of workshops took a different approach, insisting that all speakers presented in real time (whether in-person or via Zoom), with talks also recorded for remote audiences. The difference this made to the level of engagement in the room was significant and the experience was overwhelmingly more enjoyable. However with this approach, the potential for technical issues is of course increased, so there is clearly some way to go in finding the right balance. We are interested to see how conferences evolve in the coming years to address the ever-changing needs of attendees.

Exhibition stands at ICML


Thoughts from the NICD team


Other members of the team have given their personal highlights below:


Hollie: "The workshop on topology, algebra and geometry in machine learning was a great session. All the talks were live and everyone in the workshop had interesting research and ideas to share. Some of the research was related to my PhD in statistical topology, and it was nice to step back into the topology world for a day and see how those ideas could be used in machine learning."


Mac: “I was delighted to observe a growing research interest in Interpretability and Uncertainty Quantification of machine learning models. These two research topics are imperative for reliable data-driven decision-making.”


Matt: "My highlight of ICML was the workshop on Distribution-Free Uncertainty Quantification. Methods in this field of study can generate confidence intervals for ML model predictions. Unlike some previous methods, these do not depend on assumptions about the ML model or the data distribution. I think applications with high trust requirements (e.g., healthcare) are going to benefit hugely."


Fergus: "I particularly enjoyed a workshop on Data-Centric AI, which focuses on improving data used to train machine learning models. Andrew Ng's keynote about the Data-Centric AI Competition told an engaging story about some innovative ideas that the teams developed and why we need a better way of sharing datasets."


John: "At this year’s ICML I was delighted to see how many sessions focussed on the very real problems of interpreting and validating ML models. I was particularly impressed with GAM Changer - a tool which allows domain experts to tweak the learned functions in additive models."


On our day off, we all visited Washington, D.C., for some well-earned touristing!

Five NICD data scientists enjoying some sightseeing in Washington DC



Dr Hollie Johnson is a Data Scientist at the National Innovation Centre for Data whose technical background is rooted primarily in statistics and machine learning. Her PhD focussed on topological event history analysis for global climate data and she has previously worked in software development roles.

To find out more about working with us, get in touch.
We'd love to hear from you.