Hi, I’m Raghul

I’ll be writing on a range of topics, focussing on ML and probability, documenting what I’ve learnt. I’m about to submit my PhD at Cambridge, and will then be spending 6 months at Amazon. I’m interested in using ML to forecast dynamical systems, especially the weather. Please get in touch with any relevant opportunities! There’s more about me here.

Representation learning for weather data

Representation learning has been hugely beneficial to the process of getting machine learning (ML) to do useful things with text. These useful things include getting better search results from a Google search, and synthesizing images given text prompts (such as done by models like DALL-E). Representation learning is about learning meaningful ways to mathematically represent input data. The power of representation learning is that a single ML model can often extract mathematical representations of an input (e....

April 19, 2024 · 12 min · 2413 words · Raghul Parthipan

The link between causality and invariant predictors

There are a number of reasons we may wish to learn causal mechanisms when modelling a system/forecasting the weather/classifying an image. If our model captures the underlying causal mechanisms, it should be robust to new scenarios (e.g. the future), and it should still produce sensible results if we alter the input (“make an intervention”). Intervening on a system and seeing how things end up helps us make decisions. The issue is that the majority of existing ML tools simply learn correlations....

April 5, 2024 · 14 min · 2906 words · Raghul Parthipan

ML for Numerical Weather Prediction

Recently, there have been a great number of ML models which do Numerical Weather Prediction (NWP) with accuracy similar to state-of-the-art physics-based models. Moreover, these ML models are orders of magnitude quicker for creating forecasts. My intention here is to highlight the parts of these ML models which I think are particularly noteworthy. I will be using the following groupings: Efficiency - this explores what techniques are used in order to work with the large data which represents atmospheric states....

February 15, 2024 · 16 min · 3376 words · Raghul Parthipan

The Kelly Criterion and making bets

I offer you a game: I’ll flip a coin, and you will make a bet. If you guess the result correctly, I’ll give you your bet back plus 200% of the bet. If you guess wrong, I’ll keep your money. We will play this game many, many times. I ask you for a coin, and you sneakily pass me a biased coin with an 80% chance of landing on heads. Given this, the game seems favourable to you, and you sense that this is indeed a game worth playing (after all, if you say “heads”, the probability that you’ll be right is 80%)....

January 8, 2024 · 25 min · 5131 words · Raghul Parthipan

Fitting Models by Maximizing Likelihood

How should we fit models to data? If you look around, some people minimize mean-squared-error. For others, it is the mean-absolute-error which should be reduced. A few may feel inclined towards the Huber loss. The aim of this article is to convince you that when we want to fit a model to some data, it is sensible to do so by maximizing the likelihood which that model assigns to your data....

November 15, 2023 · 7 min · 1304 words · Raghul Parthipan