Using pixtral models with crew as a custom tool

Does anyone get this working by chance? I’ve been trying but keep running into issues in my pipeline.

I’m trying to create a mapping vision based tool that can understand and read maps.