Oxen.ai Blog

Welcome to the Oxen.ai blog 🐂

The team at Oxen.ai is dedicated to helping AI practictioners go from research to production. To help enable this, we host a research paper club on Fridays called ArXiv Dives, where we go over state of the art research and how you can apply it to your own work.

Take a look at our Arxiv Dives, Practical ML Dives as well as a treasure trove of content on how to go from raw datasets to production ready AI/ML systems. We cover everything from prompt engineering, fine-tuning, computer vision, natural language understanding, generative ai, data engineering, to best practices when versioning your data. So, dive in and explore – we're excited to share our journey and learnings with you 🚀

Arxiv Dives - Retrieval Augmented Generation (RAG)
Arxiv Dives - Retrieval Augmented Generation (RAG)

Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for reference. If you would like ...

Greg Schoeninger
Greg Schoeninger
Sep 25, 2023
- Arxiv Dives
9 min read
Arxiv Dives - Training Language Models to Follow Instructions (InstructGPT)
Arxiv Dives - Training Language Models to Follow Instructions (InstructGPT)

Join the "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for referen...

Greg Schoeninger
Greg Schoeninger
Sep 15, 2023
- Arxiv Dives
8 min read
Arxiv Dives - Language Models are Unsupervised Multitask Learners (GPT-2)
Arxiv Dives - Language Models are Unsupervised Multitask Learners (GPT-2)

Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for reference. If you would like ...

Greg Schoeninger
Greg Schoeninger
Sep 8, 2023
- Arxiv Dives
8 min read
Creating a Cute Custom Character with Stable Diffusion and Dreambooth
Creating a Cute Custom Character with Stable Diffusion and Dreambooth

Introduction Stable Diffusion is an incredible open-source tool for fast, effective generation of novel images across a wide variety of domains. Despite its power and convenience...

Ben Artuso
Ben Artuso
Aug 1, 2023
8 min read
Collecting Data from Human Feedback for Generative AI
Collecting Data from Human Feedback for Generative AI

Introduction Human feedback is essential to the accuracy and continual improvement of generative AI systems. Incorporating human ranking of alternative model outputs into model r...

Ben Artuso
Ben Artuso
Jul 26, 2023
6 min read
Building ML datasets from email with Oxen.ai 🐂 📧
Building ML datasets from email with Oxen.ai 🐂 📧

Making dataset management easier for all stakeholders In a previous post, we showed how Oxen's Remote Workspaces radically simplify the process of contributing to shared datasets....

Ben Artuso
Ben Artuso
Jul 18, 2023
7 min read
Machine Inference != Machine Learning
Machine Inference != Machine Learning

One of the reasons I love the AI community is the openness to share research and build on top of each other’s work. It is great to see the community continue to publish state of th...

Greg Schoeninger
Greg Schoeninger
Jul 11, 2023
5 min read
We version our code, why not our data?
We version our code, why not our data?

All machine learning solutions start with a good dataset. The author of “Deep Learning with Python” goes as far as stating Spending more effort and money on data collection almost...

Greg Schoeninger
Greg Schoeninger
Jul 11, 2023
5 min read
🐂 Contribute to Massive Datasets in Seconds with Oxen.ai’s Remote Workspaces
🐂 Contribute to Massive Datasets in Seconds with Oxen.ai’s Remote Workspaces

The datasets used for training and benchmarking machine learning models are incredibly large and continue to grow rapidly. ImageNet, a foundational dataset for visual object recogn...

Ben Artuso
Ben Artuso
Jul 11, 2023
5 min read
Generative Deep Learning Book - Chapter 5 - Autoregressive Models
Generative Deep Learning Book - Chapter 5 - Autoregressive Models

Join the Oxen.ai "Nerd Herd" Every Friday at Oxen.ai we host a public paper club called "Arxiv Dives" to make us smarter Oxen 🐂 🧠. These are the notes from the group session for...

Greg Schoeninger
Greg Schoeninger
Jun 23, 2023
- Arxiv Dives
10 min read