The ADL tested 17 open-source models, including Google’s Gemma-3, Microsoft’s Phi-4, and Meta’s Llama 3, using prompts ...