Skip to content

Eva02 weight_init #2565

Answered by rwightman
TonyCongqianWang asked this question in Q&A
Aug 10, 2025 · 2 comments · 2 replies
Discussion options

You must be logged in to vote

@TonyCongqianWang I could add support for the other init approach, but the main reason it's not there is that it's for training from scratch, and I pretty sure the observation made in the EVA02 paper was for training from scratch with their combo of MIM + CLIP style pretraining on very large datasets, which may or may not have impact for other types of pretraining. All in all given that most people are fine-tuning the EVA models from EVA weights, lower priority to add (and more time consumingly, test) the alternative init mode.

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
2 replies
@TonyCongqianWang
Comment options

@rwightman
Comment options

Answer selected by TonyCongqianWang
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants