categorizations.parquet1.8 mb
update generate_questions script to take in new fields
1 year ago chunks.parquet2.3 mb
update generate_questions script to take in new fields
1 year ago documents.parquet2.4 mb
update generate_questions script to take in new fields
1 year ago questions.parquet153.2 kB
update generate_questions script to take in new fields
1 year ago arxiv_questions.parquet1.1 mb
extract images from pdfs and consolidate title and abstract extraction
1 year ago arxiv_markdown_chunks_are_interesting.parquet8.3 mb
add filter for if the chunks are interesting
1 year ago arxiv_markdown.parquet2.1 mb
Updated "title_embedding" with text-embedding-3-small
1 year ago arxiv_markdown_chunks.parquet8.6 mb
Computing embeddings for paper sections
1 year ago