100_questions648.5 kB
Updated "answer_relevance_question_embeddings" with text-embedding-3-small
1 year ago ragas8.6 mb
add section embeddings to ragas
1 year ago collected_data.jsonl81.1 kB
Merge commit 196e12f90ed312f456dd409f58a47a49 into 1693168f05dc8a6fb13215ec5515685f
1 year ago questions.parquet649.6 kB
adding questions to data/ragas
1 year ago questions_answered.parquet1 mb
Answering the questions with GPT-4o
1 year ago answerable_questions.parquet650 kB
remove oxen columns from committed data/answerable_questions.parquet
1 year ago documents.parquet2.4 mb
fix parsing of image tags in markdown docs
1 year ago categorizations.parquet1.8 mb
update generate_questions script to take in new fields
1 year ago chunks.parquet2.3 mb
update generate_questions script to take in new fields
1 year ago arxiv_questions.parquet1.1 mb
extract images from pdfs and consolidate title and abstract extraction
1 year ago arxiv_markdown_chunks_are_interesting.parquet8.3 mb
add filter for if the chunks are interesting
1 year ago arxiv_markdown.parquet2.1 mb
Updated "title_embedding" with text-embedding-3-small
1 year ago arxiv_markdown_chunks.parquet8.6 mb
Computing embeddings for paper sections
1 year ago