questions.parquet1.9 mb
Filtering questions that are not able to be answered by the context.
1 year ago documents.parquet2.4 mb
fix parsing of image tags in markdown docs
1 year ago categorizations.parquet1.8 mb
update generate_questions script to take in new fields
1 year ago chunks.parquet2.3 mb
update generate_questions script to take in new fields
1 year ago arxiv_questions.parquet1.1 mb
extract images from pdfs and consolidate title and abstract extraction
1 year ago arxiv_markdown_chunks_are_interesting.parquet8.3 mb
add filter for if the chunks are interesting
1 year ago arxiv_markdown.parquet2.1 mb
Updated "title_embedding" with text-embedding-3-small
1 year ago arxiv_markdown_chunks.parquet8.6 mb
Computing embeddings for paper sections
1 year ago