Vitess Latency Profiling: Vtexplain and Query Analysis (2026)

Vitess isn’t just about sharding; it’s a distributed SQL database that offers deep insights into query performance.

Let’s see Vitess in action. Imagine a typical application with a users table.

-- Schema
CREATE TABLE users (
    id BIGINT,
    name VARCHAR(100),
    email VARCHAR(100),
    PRIMARY KEY (id)
);

-- Sharding
-- This table is sharded by `id`.
-- A typical Vitess workflow involves a client sending a query to the VTGate.
-- VTGate consults the VSchema to determine which vindex to use for routing.
-- For `users` table, VTGate uses the `id` column's vindex.
-- If the query is `SELECT * FROM users WHERE id = 123`, VTGate determines the shard containing `id = 123`.
-- It then forwards the query to the appropriate VReplication stream or VTablet.
-- The VTablet executes the query against the underlying MySQL.
-- Results are sent back to VTGate, which aggregates and returns to the client.

The vtexplain command-line tool is your primary weapon for understanding query plans before they even hit the wire. It’s like EXPLAIN in MySQL, but with Vitess-specific intelligence.

vtexplain --keyspace=my_keyspace --vschema='{"sharded": true, "vindexes": {"my_vindex": {"type": "numeric", "column": "id"}}, "tables": {"users": {"column_vindexes": [{"column": "id", "name": "my_vindex"}]}}}' 'SELECT * FROM users WHERE id = 123'

This command will output a JSON structure detailing how Vitess plans to execute your query. You’ll see information like the target vindexes, the routing plan (which shards will be queried), and the estimated cost. This helps identify potential bottlenecks before deployment.

Beyond vtexplain, Vitess provides detailed query statistics through its VTablet and VTGate endpoints. The /debug/query_log endpoint on VTGate is invaluable. It shows you a real-time stream of queries being processed, their execution times, and importantly, the plan Vitess used.

# Example: Fetching query log from VTGate on port 15991
curl http://localhost:15991/debug/query_log

This output is dense but reveals critical metrics. Look for queries with unusually high execution_time or rows_affected. The plan field within each log entry shows the vindex and keyspace_ids that Vitess used for routing. If a query is consistently hitting many shards when it should only hit one, or if it’s performing a full table scan on a sharded table, you’ve found a problem.

The vindex is the core of Vitess’s routing. It maps your query’s WHERE clause columns to keyspace_ids, which then determine the shard. If your vindex is poorly chosen or your query isn’t using the indexed column effectively, Vitess might resort to broadcasting the query to all shards, leading to massive latency.

For example, if you have a users table sharded by id using a numeric vindex, a query like SELECT * FROM users WHERE email = 'test@example.com' will be slow if email is not part of a vindex. Vitess doesn’t know which shard to send this to directly. It will likely broadcast the query. vtexplain would show this as a "scatter" operation.

# Example of a poorly routed query plan (conceptual output)
{
  "query": "SELECT * FROM users WHERE email = 'test@example.com'",
  "plan": {
    "Type": "scatter", // Indicates query will be sent to all shards
    "Vindex": null,
    "KeyspaceIDs": null,
    "Query": "SELECT * FROM users WHERE email = 'test@example.com' /* vitess:scatter */"
  }
}

The fix is usually to create a secondary vindex on the email column or to ensure your application always queries by the primary vindex column (id).

A common pitfall is overlooking the VSchema configuration. The VSchema is the blueprint for how Vitess understands your keyspaces, tables, and vindexes. An incorrect VSchema can lead Vitess to misinterpret your sharding strategy, causing inefficient routing or outright query failures. Always validate your VSchema with vschema --keyspace=<your_keyspace> --json.

When troubleshooting latency, pay close attention to the num_shards field in the vtexplain output for scatter queries. If this number is larger than expected, it means your query is being sent to more shards than necessary. This often points to a missing or incorrectly configured vindex. The solution is to define a vindex for the queried column or ensure your application uses the correct column in its WHERE clause.

The next hurdle after optimizing query routing is understanding and managing distributed transactions.

More Deep Dives in Vitess