How to prevent data leakage in pandas & scikit-learn ☔
What is data leakage, why is it problematic, and how can you prevent it when working on a supervised Machine Learning problem in Python?continue reading.
What is data leakage, why is it problematic, and how can you prevent it when working on a supervised Machine Learning problem in Python?continue reading.
Highlights to the most recent updates to `sparklyr` and friendscontinue reading.
Learn how to “discretize” or “bin” your continuous features using Python’s scikit-learn, and find out why I usually don’t recommend doing so.continue reading.
Interact with Github Copilot and OpenAI’s GPT (ChatGPT) models directly in RStudio. The `chattr` Shiny add-in makes it easy for you to interact with these and other Large Language Models...continue reading.
Welcome to the future of pharmaceuticals, where the fusion of science and technology opens new horizons in healthcare. Imagine a world where the journey from the laboratory to the patient’s...continue reading.
Dealing with the problem of overfitting is one of the core issues in machine learning and AI. Your model seems to work perfectly on the training set, but when you...continue reading.
If you want to build high-performing machine learning and AI systems, then simply training those systems is rarely enough. You often need to build multiple models, often with multiple different...continue reading.
tldr: I used GPT-4 Turbo, GPT-3.5 Turbo, and two open-source offline LLMs to create flashcards for a spaced repetition system (Anki) on a mathematical topic; I rated the 100 LLM-suggested...continue reading.
In machine learning, making sure that you have a model that performs well is, in some sense, the most important thing. This means that you need to be really good...continue reading.
In this Microsoft Fabric series: To wrap up the series, let’s check the material available online, for you to continue learning, exploring and enjoying Microsoft Fabric. The official website: https://www.microsoft.com/en-us/microsoft-fabricMicrosoft...continue reading.
Mmm. Overfitting. It’s the bane of most machine learning developers. You build a model that performs so well on the training data, and think “I’ve done such a good job!”...continue reading.
In this Microsoft Fabric series: OneLake comes automatically with every Microsoft Fabric tenant and represents a single, logical data lake. Its main features are its unification and one copy of...continue reading.
In this Microsoft Fabric series: Admin portal serves purpose for governing and setting the Microsoft Fabric, where you can make tenant settings, also access the Microsoft 365 admin portal, and...continue reading.
In this Microsoft Fabric series: Apps are collections of dashboards and reports in one easy-to-find place. Go to Apps and click on “Get Apps”. Click the Microsoft Fabric Capacity Metrics...continue reading.
In this Microsoft Fabric series: Monitoring workspaces, executions and checking logs is so quintessential, that one should get familiarized with this in the first place. Monitoring Hub The easy way...continue reading.
Welcome to our deep dive into one of the foundations of machine learning: Training, Validation, and Test Sets. In this blog post, I’ll explain the purpose of having these different...continue reading.
In this Microsoft Fabric series: Notebooks have been around for a long time and people, community, and professionals have proven the usability, practicality, versioning and reliability of notebooks. Not to...continue reading.
In this Microsoft Fabric series: In Fabric, you can create streaming semantic model and when selecting you will get the usual sources: Differences are explained here: Once we create a...continue reading.
In this Microsoft Fabric series: We have created a Power BI report directly from the datalake and today we will check how to do same with dashboard and paginated reports....continue reading.
If you want to master machine learning and AI, you’ll need to learn and master a variety of minor concepts that underpin these systems. One such concept is the classification...continue reading.