lilian
LilianZhou
User account
LilianZhou's Repositories
Displaying Page 1 of 2 (13 total Repositories)
Public
0
102.4 mb
27
Public
0
40.9 kB
22
Public
0
38.1 mb
21
Public
0
234.5 mb
2210K
Public
0
3.8 mb
32
Public
0
3.3 mb
3620
Public
6
This is the dataset for pretraining the Large Language and Vision Assistant(LLaVA), an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding.
6.6 gb
12595K