Learning Machines – A blog about data, science, and learning machines

Learning Data Science: Why a High R^2 Can Be Misleading

A high $R^2$ can make a regression model look impressively accurate — but this number can be deceptive. If you want to understand why a high $R^2$ is not always a sign of a good model, read on!

Continue reading “Learning Data Science: Why a High R^2 Can Be Misleading”

The Magic of In-Context Learning (ICL): When Your Model Already Knows Your Data

Have you ever looked at a freshly plotted scatter plot and immediately thought, “Ah, this is clearly a logarithmic curve with some heteroskedastic noise,” without running a single line of modeling code? How do you do that? You don’t perform gradient descent in your head. You use your intuition!
Continue reading “The Magic of In-Context Learning (ICL): When Your Model Already Knows Your Data”

Building Your Own Mini-ChatGPT with R: From Markov Chains to Transformers!

Remember our journey so far? We started with simple Markov chains showing how statistical word prediction works, then dove into the core concepts of word embeddings, self-attention, and next word prediction. Now, it’s time for the grand finale: if you want to build your own working transformer language model in R, read on!
Continue reading “Building Your Own Mini-ChatGPT with R: From Markov Chains to Transformers!”

Can Money Really Buy Happiness? Or How to Lie with Statistics in Science

It’s a widely accepted notion that money influences happiness, a concept famously associated with Nobel laureate Daniel Kahneman, who purportedly demonstrated that emotional wellbeing increases with income but plateaus beyond an annual threshold of about $75,000.

This idea has permeated both academic circles and popular media, reinforcing the belief that there’s a direct correlation between financial prosperity and happiness. But how accurate is this belief when we scrutinize the data more closely? To find out read on!
Continue reading “Can Money Really Buy Happiness? Or How to Lie with Statistics in Science”

Artificial Intelligence in Academic Theses: An Opportunity, Not a Threat

In an era where artificial intelligence (AI) is increasingly permeating various aspects of our lives, the academic world is also faced with the challenge of dealing with this rapid technological development. This is particularly true regarding final theses and term papers, raising the question of how we, as educational institutions, should handle the use of foundation models like ChatGPT, Google Gemini, and other language-based models (LLMs).
Continue reading “Artificial Intelligence in Academic Theses: An Opportunity, Not a Threat”

Reversion to the Mean: Unraveling a Pervasive Misconception in Business and Beyond

In the realm of business and leadership, one statistical phenomenon often goes unrecognized yet significantly influences our understanding of performance and success. This is the concept of reversion to the mean (also called regression to the mean). This seemingly simple statistical occurrence can profoundly impact how we perceive management strategies, leadership effectiveness, and even the fate of those gracing the covers of prominent magazines. To understand what is going on, read on!
Continue reading “Reversion to the Mean: Unraveling a Pervasive Misconception in Business and Beyond”

The Forgotten Factor in the Middle East Conflict | Youth Bulge Theory

The geopolitics of the Middle East has always been complex, with its share of conflicts and political unrest. Numerous theories have been proposed to dissect the underpinnings of the region’s political instability.

One such theory, known as the Youth Bulge Theory, asserts that a high proportion of young people within a population can lead to political instability and even violence. This theory could provide an illuminating perspective on the dynamics of the Middle East conflicts.

If you want to understand this most important ingredient of the ongoing conflict, read on!
Continue reading “The Forgotten Factor in the Middle East Conflict | Youth Bulge Theory”

Confidence Intervals in Election Polling: Understanding the Uncertainty of Political Forecasting

Election polls play a crucial role in predicting the outcome of elections and shaping public opinion. However, it’s important to understand that the results of any single poll should be taken with a grain of salt.

Many polls only ask about 1,000 people about their political preferences, which is quite small in comparison to the often millions of voters. So, how reliable are those results? Or in other words, how confident can we be in the results? To understand some of these intricacies, read on!
Continue reading “Confidence Intervals in Election Polling: Understanding the Uncertainty of Political Forecasting”

Is this still Weather or is it already Climate? Decoding Chaos!

Weather and climate are words often used interchangeably in casual conversations. But when a chilly summer breeze sweeps through in July, or when unexpected rains dampen our winter holidays, the age-old debate resurfaces: “Is climate change even real?”

If you want to dive deep into the fascinating realm of chaotic systems to unravel this enigma and distinguish between weather and climate, read on!
Continue reading “Is this still Weather or is it already Climate? Decoding Chaos!”

Can a Simple Multi-Agent Model Replicate Complex Stock Market Behaviour?

The stock market is one of the most complex systems we know about. Millions of intelligent, highly competitive people (and increasingly AIs) try to outwit each other to earn as much money as possible.

In this post we build a simulation where little agents employ different trading strategies on an artificial stock market to replicate key stylized facts of real financial markets, so read on!
Continue reading “Can a Simple Multi-Agent Model Replicate Complex Stock Market Behaviour?”