Artificial Intelligence in Academic Theses: An Opportunity, Not a Threat


In an era where artificial intelligence (AI) is increasingly permeating various aspects of our lives, the academic world is also faced with the challenge of dealing with this rapid technological development. This is particularly true regarding final theses and term papers, raising the question of how we, as educational institutions, should handle the use of foundation models like ChatGPT, Google Gemini, and other language-based models (LLMs).
Continue reading “Artificial Intelligence in Academic Theses: An Opportunity, Not a Threat”

Attention! What lies at the Core of ChatGPT? (Also as a Video!)


Word embedding, self-attention, and next-word prediction lie at the core of LLMs like ChatGPT. If you are curious about how these techniques work and want to see a simple example in R, read on!
Continue reading “Attention! What lies at the Core of ChatGPT? (Also as a Video!)”

ChatGPT can Create Datasets, Program in R… and when it makes an Error it can Fix that too!


ChatGPT from OpenAI leaves me speechless over and over again. I have been in the AI industry for many decades now and it has been a long time since I last had this feeling of utter fascination mixed with disbelief mixed with anxiety.

This is only a quick post in the context of R programming which I wanted to share with you, so read on!
Continue reading “ChatGPT can Create Datasets, Program in R… and when it makes an Error it can Fix that too!”

Learning Data Science: Predictive Maintenance with Decision Trees


Predictive Maintenance is one of the big revolutions happening across all major industries right now. Instead of changing parts regularly or even only after they failed it uses Machine Learning methods to predict when a part is going to fail.

If you want to get an introduction to this fascinating developing area, read on!
Continue reading “Learning Data Science: Predictive Maintenance with Decision Trees”

Please Subscribe to My New (German) Data Science YouTube Channel!


I am in the middle of creating a new German YouTube channel that is centered around data science and R! I put a lot of effort into it to serve the interests of the community.

If you want to be a part of the process, watch interesting videos with data-based analyses and look behind the scenes, please consider subscribing to the channel!

The number of subscriptions is also vital for the YouTube algorithm to recommend the videos to other viewers on the platform!
Continue reading “Please Subscribe to My New (German) Data Science YouTube Channel!”

Learning Path for “Data Science with R” – Part I


Over the course of the last two and a half years, I have written over one hundred posts for my blog “Learning Machines” on the topics of data science, i.e. statistics, artificial intelligence, machine learning, and deep learning.

I use many of those in my university classes and in this post, I will give you the first part of a learning path for the knowledge that has accumulated on this blog over the years to become a well-rounded data scientist, so read on!
Continue reading “Learning Path for “Data Science with R” – Part I”

Will I get my Money back? Credit Scoring with OneR


More and more decisions by banks on who gets a loan are being made by artificial intelligence. The terms being used are credit scoring and credit decisioning.

They base their decisions on models whether the customer will pay back the loan or will default, i.e. determine their creditworthiness. If you want to learn how to build such a model in R yourself (with the latest R ≥ 4.1.0 syntax as a bonus), read on!
Continue reading “Will I get my Money back? Credit Scoring with OneR”

The Most Advanced AI in the World explains what AI, Machine Learning, and Deep Learning are!


This is our 101’st blog post here on Learning Machines and we have prepared something very special for you!

Oftentimes the different concepts of data science, namely artificial intelligence (AI), machine learning (ML), and deep learning (DL) are confused… so we asked the most advanced AI in the world, OpenAI GPT-3, to write a guest post for us to provide some clarification on their definitions and how they are related.

We are most delighted to present this very impressive (and only slightly redacted) essay to you – enjoy!
Continue reading “The Most Advanced AI in the World explains what AI, Machine Learning, and Deep Learning are!”

Recidivism: Identifying the Most Important Predictors for Re-offending with OneR


In 2018 the renowned scientific journal science broke a story that researchers had re-engineered the commercial criminal risk assessment software COMPAS with a simple logistic regression (Science: The accuracy, fairness, and limits of predicting recidivism).

According to this article, COMPAS uses 137 features, the authors just used two. In this post, I will up the ante by showing you how to achieve similar results using just one simple rule based on only one feature which is found automatically in no-time by the OneR package, so read on!
Continue reading “Recidivism: Identifying the Most Important Predictors for Re-offending with OneR”

Cupid’s Arrow: How to Boost your Chances at Dating!


During our little break, Valentine’s Day was celebrated. Yet for many, it was a depressing day because they are single and are looking for love.

Speed dating is a popular format (in times of Covid-19 also in virtual form) to meet many different potential soul mates in a short period of time. If you want to learn which factors determine “getting to the next round”, read on!
Continue reading “Cupid’s Arrow: How to Boost your Chances at Dating!”