arxiv_questions.parquet1.1 mb
extract images from pdfs and consolidate title and abstract extraction
1 year ago documents.parquet2.4 mb
extract images from pdfs and consolidate title and abstract extraction
1 year ago arxiv_markdown_chunks_are_interesting.parquet8.3 mb
add filter for if the chunks are interesting
1 year ago arxiv_markdown.parquet2.1 mb
Updated "title_embedding" with text-embedding-3-small
1 year ago arxiv_markdown_chunks.parquet8.6 mb
Computing embeddings for paper sections
1 year ago