A herd of enthusiastic oxen running towards the future
Oxen.ai Blog

Welcome to the Oxen.ai blog 🐂

The team at Oxen.ai is dedicated to helping AI practictioners go from research to production. To help enable this, we host a research paper club on Fridays called ArXiv Dives, where we go over state of the art research and how you can apply it to your own work.

Take a look at our Arxiv Dives, Practical ML Dives as well as a treasure trove of content on how to go from raw datasets to production ready AI/ML systems. We cover everything from prompt engineering, fine-tuning, computer vision, natural language understanding, generative ai, data engineering, to best practices when versioning your data. So, dive in and explore – we're excited to share our journey and learnings with you 🚀

Recent
Mamba: Linear-Time Sequence Modeling with Selective State Spaces - Arxiv Dives
Dec 15, 2023

What is Mamba 🐍? Mamba at it's core is a recurrent neural network architecture, that outperforms Transformers with faster inference and improved handling of long sequences of length up to 1 million. This post dives into how it works and will give y...

Arxiv Dives
Practical ML Dive - How to customize a Vision Transformer on your own data
Dec 14, 2023

Welcome to Practical ML Dives, a series spin off of Arxiv Dives. In Arxiv Dives, we cover state of the art research papers, and dive into the gnitty gritty details of how AI models work. From the math to the data to the model architecture, we cover ...

Arxiv Dives
Arxiv Dives - Zero-shot Image Classification with CLIP
Dec 8, 2023

CLIP explores the efficacy of learning image representations from scratch with 400 million image-text pairs, showcasing zero-shot transfer capabilities across diverse computer vision tasks. This post dives into how it works and will give you an intui...

Arxiv Dives
How NOT to store unstructured machine learning datasets
Dec 8, 2023

Training data is typically the most valuable part of any machine learning project. As we converge on model architectures like the transformer that perform well on many tasks, it is the data that goes into the model that makes the difference. Data i...

🧼 SUDS - A Guide to Structuring Unstructured Data
Dec 8, 2023

At Oxen.ai we value high quality datasets. We have many years of experience training and evaluating models, and have seen many interesting data formats. Interesting is something we should optimize for when it comes to content of a dataset, not the fo...

Arxiv Dives - Vision Transformers (ViT)
Dec 1, 2023

With all of the hype around Transformers for natural language processing and text, the authors of this paper beg the question - can we apply self-attention and Transformers to images as well? This post dives into how it works and will give you an int...

Arxiv Dives
Reading List For Andrej Karpathy’s “Intro to Large Language Models” Video
Nov 27, 2023

Andrej Karpathy recently released an hour long talk on “The busy person’s intro to large language models” that had some great tidbits whether you are an expert in machine learning or just getting starting in AI. There are a lot of resources, papers,...

Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 2
Nov 21, 2023

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. We believe diving into the details of research papers is the best way to build fundamental knowledge and keep up with the bleeding edge. Our goal is to ...

Arxiv Dives
Arxiv Dives - A Mathematical Framework for Transformer Circuits - Part 1
Nov 11, 2023

Every Friday at Oxen.ai we host a paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. We believe diving into the details of research papers is the best way to build fundamental knowledge and keep up with the bleeding edge. If you would li...

Arxiv Dives
Data Version Control 101 with Oxen
Nov 9, 2023

This intro tutorial from Oxen.ai shows how Oxen can make versioning your data as easy as versioning your code. Oxen is built to track and store changes for everything from a single CSV to data repositories with millions of unstructured images, video...