Recent Posts
Decoding Transformers: The Neural Nets Behind LLMs and More
When Karpathy was asked by Lex Fridman “What is the most beautiful or surprising idea in deep learning or AI”, ...
Read More
Read More
Deep Learning Gymnastic Series
Deep Learning Gymnastics #1: Tensor Broadcasting
In the heart of the implementation of modern deep learning models (yes, including LLMs) always lies some subtle and critical ...
Read More
Read More
Deep Learning Gymnastics #2: Tensor Indexing
Welcome to the second episode of the Deep Learning Gymnastics series. Hope you're in good shape. Get warmed up. We ...
Read More
Read More
Deep Learning Gymnastics #3: Tensor (re)Shaping
Welcome to the 3rd episode of the Deep Learning Gymnastics series. By now you should already start to be in ...
Read More
Read More
Deep Learning Gymnastics #4: Master Your (LLM) Cross Entropy
Welcome to the 4th episode of our Deep Learning Gymnastics series. Today, we’ll use all the skills learned in our ...
Read More
Read More
Deep Dive in Logistic Regression Series
Deep Dive Into Logistic Regression: Part 1
Learn the fundamental theory behind logistic regression ...
Read More
Read More
Deep Dive Into Logistic Regression: Part 2
Want to know how to implement Stochastic Gradient Descent for Logistic regression able to learn millions of parameters using the hashing ...
Read More
Read More
Deep Dive Into Logistic Regression: Part 3
In this third and last post of this series, we present the use of a very effective and powerful library ...
Read More
Read More
Other/Older Posts
Decoding Transformers: The Neural Nets Behind LLMs and More
When Karpathy was asked by Lex Fridman “What is the most beautiful or surprising idea in deep learning or AI”, ...
Read More
Read More
Deep Learning Gymnastics #4: Master Your (LLM) Cross Entropy
Welcome to the 4th episode of our Deep Learning Gymnastics series. Today, we’ll use all the skills learned in our ...
Read More
Read More
Deep Learning Gymnastics #3: Tensor (re)Shaping
Welcome to the 3rd episode of the Deep Learning Gymnastics series. By now you should already start to be in ...
Read More
Read More
Visualising SGD with Momentum, Adam and Learning Rate Annealing
[Full code on my github here . To see it from mobile, once you land on github, click on "Desktop Version" ...
Read More
Read More
A Data Science Exploration From the Titanic in R
Kaggle offered this year a knowledge competition called "Titanic: Machine Learning from Disaster" exposing a popular "toy-yet-interesting" data set around the ...
Read More
Read More
How To Easily Build And Observe TF-IDF Weight Vectors With Lucene And Mahout
You have a collection of text documents, and you want to build their TF-IDF weight vectors, probably before doing some clustering ...
Read More
Read More
What Are The 10 Most Cited Websites On Twitter When Tweeting About Hot Trends?
Lately I wrote a post on how to build a relevant real time search engine prototype in few hundreds lines ...
Read More
Read More
Hadoop Tutorial Series, Issue #4: To Use Or Not To Use A Combiner
Welcome to the fourth issue of the Hadoop Tutorial Series. Combiners are another important Hadoop's feature that every hadoop developer ...
Read More
Read More
Hadoop Tutorial Series, Issue #3: Counters In Action
Note: This post has been updated with a code working for hadoop 0.20.1. In this 3rd issue of the hadoop ...
Read More
Read More
How To Build A Relevant Real Time Search Engine Prototype In Few Hundreds Lines Of Code
By the end of the post you'll find the code along with a small command line JAVA program to play ...
Read More
Read More
Hadoop Tutorial Series, Issue #2: Getting Started With (Customized) Partitioning
In the Issue #1 of this series, we set up the "learning playground" (based on the Cloudera Virtual Machine) in ...
Read More
Read More
Hadoop Tutorial Series, Issue #1: Setting Up Your MapReduce Learning Playground
Update: Instructions updated for hadoop 0.20.2. This is the first post of a series of small hadoop tutorials introducing progressively ...
Read More
Read More
Flexible Collaborative Filtering In JAVA With Mahout Taste
I recently had to build quickly a prototype of recommendation engine for a promising start-up company. I wanted to first ...
Read More
Read More
Writing A Token N-Grams Analyzer In Few Lines Of Code Using Lucene
If you need to parse the tokens n-grams of a string, you may use the facilities offered by lucene analyzers ...
Read More
Read More
Drawing A Zipf Law Using Gnuplot, Java and Moby-Dick
There are many tools out there to build more or less quickly any kind of graphs. Depending on your needs ...
Read More
Read More
Flexible Java Profiling And Monitoring Using The Netbeans Profiler
I have tested a lot of those open source profiler. My preference goes definitely to the integrated Netbeans profiler. It ...
Read More
Read More
BeanShell Tutorial: Quick Start On Invoking Your Own Or External Java Code From The Shell
BeanShell is a lightweight scripting language that’s compatible with the Java language. It provides a dynamic environment for executing Java ...
Read More
Read More
5 Video Tutorials Of Small To Killer Eclipse Shortcuts
I believe that when you spend a significant percentage of your time on a specific software, it is an obligation ...
Read More
Read More
Google Hot Trends Clustering: The 100 Hottest Queries Tell You About 67.76 Stories In Average
Did you noticed that among the 100 (hourly updated) Google Hot Trends, there are always several hot queries that are ...
Read More
Read More
Open Calais From Java: Get Ready To Extract Entities, Facts And Events In 4 Minutes!
I'm a big fan of Open Calais, the well known web service that allows you to perform Named Entity, Facts ...
Read More
Read More
The Trick To Write A Fast (Universal) Java URL Expander
140 characters. Means something to you? This is about how twitter (and micro-blogging) was born. Even if some profane firefox ...
Read More
Read More
Can You Guess What Is The Hottest Trend Of Google Hot Trends ?
Either if you are working in SEO, or if you are a "trends hacker", or if you love like me ...
Read More
Read More