Running a multi-model stack is popular for capturing diverse strengths and...
https://anotepad.com/notes/gbq7ssq6
Running a multi-model stack is popular for capturing diverse strengths and identifying failure modes through comparison. However, proceed with caution. Blindly synthesizing outputs can obscure important model dissent and create a false sense of accuracy