Oxen.ai Blog

Welcome to the Oxen.ai blog 🐂

The team at Oxen.ai is dedicated to helping AI practictioners go from research to production. To help enable this, we host a research paper club on Fridays called ArXiv Dives, where we go over state of the art research and how you can apply it to your own work.

Take a look at our Arxiv Dives, Practical ML Dives as well as a treasure trove of content on how to go from raw datasets to production ready AI/ML systems. We cover everything from prompt engineering, fine-tuning, computer vision, natural language understanding, generative ai, data engineering, to best practices when versioning your data. So, dive in and explore – we're excited to share our journey and learnings with you 🚀

We version our code, why not our data?
We version our code, why not our data?

All machine learning solutions start with a good dataset. The author of “Deep Learning with Python” goes as far as stating Spending more effort and money on data collection almost...

Greg Schoeninger
Greg Schoeninger
Jul 11, 2023
5 min read
🐂 Contribute to Massive Datasets in Seconds with Oxen.ai’s Remote Workspaces
🐂 Contribute to Massive Datasets in Seconds with Oxen.ai’s Remote Workspaces

The datasets used for training and benchmarking machine learning models are incredibly large and continue to grow rapidly. ImageNet, a foundational dataset for visual object recogn...

Ben Artuso
Ben Artuso
Jul 11, 2023
5 min read
Generative Deep Learning Book - Chapter 5 - Autoregressive Models
Generative Deep Learning Book - Chapter 5 - Autoregressive Models

Join the Oxen.ai "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for...

Greg Schoeninger
Greg Schoeninger
Jun 23, 2023
- Arxiv Dives
10 min read
Generative Deep Learning Book - Chapter 4 - Generative Adversarial Networks (GANs)
Generative Deep Learning Book - Chapter 4 - Generative Adversarial Networks (GANs)

Join the Oxen.ai "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for...

Greg Schoeninger
Greg Schoeninger
Jun 15, 2023
- Arxiv Dives
7 min read
Generative Deep Learning Book - Chapter 3 - Variational Auto Encoders
Generative Deep Learning Book - Chapter 3 - Variational Auto Encoders

Join the Oxen.ai "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for...

Greg Schoeninger
Greg Schoeninger
Jun 10, 2023
- Arxiv Dives
8 min read
Generative Deep Learning Book - Chapters 1 & 2 - Intro
Generative Deep Learning Book - Chapters 1 & 2 - Intro

Join the Oxen.ai "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for...

Greg Schoeninger
Greg Schoeninger
Jun 4, 2023
- Arxiv Dives
9 min read
Generative Deep Learning Book - Preface
Generative Deep Learning Book - Preface

Join the Oxen.ai "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for...

Greg Schoeninger
Greg Schoeninger
Jun 3, 2023
- Arxiv Dives
4 min read
Command Line Tool to Inspect Parquet, CSV, and other DataFrames 🐂 🌾
Command Line Tool to Inspect Parquet, CSV, and other DataFrames 🐂 🌾

When it comes to data, most data scientists know how to use pandas for exploratory data analysis. Spin up a python environment or Jupyter notebook and start loading your data. Thes...

Greg Schoeninger
Greg Schoeninger
Apr 20, 2023
3 min read
Blazing Fast Data Version Control with Oxen 🐂 🔥
Blazing Fast Data Version Control with Oxen 🐂 🔥

As data scientists or machine learning engineers data is the key to what makes our products succeed. We are constantly running experiments, and the world that is constantly changin...

Greg Schoeninger
Greg Schoeninger
Apr 20, 2023
2 min read