Skip to main content

Finetuning Toolshim Models for Tool Calling

· 7 min read
Alice Hau
Machine Learning Engineer
Michael Neale
Principal Engineer

blog cover

Our recently published Goose benchmark revealed significant performance limitations in models where tool calling is not straightforwardly supported (e.g., Gemma3, Deepseek-r1, phi4). These models often fail to invoke tools at appropriate times or produce malformed or inconsistently formatted tool calls. With the most recent releases of Llama4 and Deepseek v3 (0324), we are again observing challenges with effective tool calling performance, even on these flagship openweight models.

A Visual Guide To MCP Ecosystem

· 4 min read
Ebony Louis
Developer Advocate

blog cover

You ever open a GitHub repo or blog post, read the first sentence, and immediately feel like you’ve stumbled into a PhD dissertation?

Yeah. Same.

MCP (Model Context Protocol) sounds complicated, but it’s really not. Think of this as your go to cheat sheet, no whitepapers, no academic jargon, just plain English and a few good visuals.

How to Vibe Code Responsibly (with Goose)

· 7 min read
Rizel Scarlett
Staff Developer Advocate

blog cover

On Feb 2, 2025, Andrej Karpathy coined the phrase "vibe coding". Vibe coding represents a new approach to coding where developers ask an AI agent to build something, and they go with the flow.

The Model Context Protocol (MCP) makes this practice possible. Before MCP, developers copied and pasted context between applications. This workflow fell short of the promised AI agent automation that everyone claimed. Today, AI agents can work autonomously using MCP and integrate with any application, from GitHub to Cloudflare, YouTube, and Figma.

This shift democratizes coding. For example, it's empowered:

  • Web developers to create video games with Unity
  • Designers and product managers to prototype full-stack applications
  • Business owners to transform their visions into functional products

It's a freeing experience. But too often, we're Icarus with the keyboard, vibe coding too close to the sun.

Top 5 MCP Servers I Use as a Developer with Goose

· 6 min read
Adewale Abati
Staff Developer Advocate

blog cover

As a developer, finding the right tools that seamlessly work together can feel like discovering a superpower. And when you have a working process, it can sometimes be difficult to try out new tools.

With the introduction of MCPs, AI agents like Goose are able to plug in to my existing tools, and the only thing that changes with my workflow is that much welcomed automation that comes with it. I still do the same things I do, but backed by AI, I can now do them faster and with more confidence.

Today, I'm excited to share not just my favorite MCP servers, but the ones I actually use almost everyday with real applications that you can probably relate to as well.

Community-Inspired Benchmarking: The Goose Vibe Check

· 21 min read
Alice Hau
Machine Learning Engineer

blog cover

We've been measuring Goose's performance with various AI models, including a variety of popular open-source models that can run locally on consumer hardware (RTX 4080, Mac M-series). We understand that many in our community value a fully open-source, local experience without relying on cloud services.

This blog shares our findings comparing open-source models against their closed counterparts, highlighting both current performance gaps and paths for future improvement. Our benchmark is still in its early stages, but we wanted to release it as a starting point for distinguishing models that exhibit stronger agentic capabilities by their ability to pilot Goose (distinct from reasoning or other capabilities often captured in other popular benchmarks).

Securing the Model Context Protocol

· 11 min read
Alex Rosenzweig
Staff Security Engineer

blog cover

Authors: Alex Rosenzweig, Arihant Virulkar, Andrea Leoszko, Wes Ring, Mike Shema, F G, Alex Klyubin, Michael Rand, Zhen Lian, Angie Jones, Douwe Osinga, Mic Neale, Bradley Axen, Gelareh Taban

At Block, we’ve been working hard to augment the capabilities of AI tooling by building "MCP Servers" which are designed to help make our Artificial Intelligence (AI) Agent codename goose more capable of interacting with the systems and tools we care about.

Block’s Information Security (InfoSec) team has been heavily involved in this work and we wanted to capture our learnings in the space to help others. We expect there to be growing adoption and use cases for this including applying the technology in the security domain.

Vibe Coding with Goose and the Speech MCP

· 4 min read
Adewale Abati
Staff Developer Advocate

blog cover

Imagine creating an app just by describing what you want out loud, like you’re talking to a friend. That’s the magic of vibe coding: turning natural language into working code with the help of an AI agent. And while typing a prompt gets the job done, saying it out loud hits different 🔥 The new Speech MCP server has quite literally entered the chat.

Codename Goose Goes to Boston

· 6 min read
Rizel Scarlett
Staff Developer Advocate

blog cover

Question: What happens when you bring 70+ AI enthusiasts, open source contributors, and curious learners together in one room?

Answer: You get an electric night filled with great conversations, hands-on hacking, and mind-blowing insights into agentic systems.

This week, we hosted our very first Goose Meetup in Boston at the Cambridge Innovation Center. The turnout and energy exceeded all expectations! From first-time Goose users to seasoned AI engineers, attendees gathered to explore how Goose and the Model Context Protocol (MCP) are shaping the future of AI automation.