PluginBench
Skill
Pass
Audit score 90

sql-optimization-patterns

wshobson/agents

Master SQL query optimization, indexing strategies, and EXPLAIN analysis to eliminate slow queries.

What is sql-optimization-patterns?

This skill teaches systematic SQL optimization through query execution plan analysis, strategic indexing, and performance monitoring. Use it when debugging slow queries, designing schemas, or optimizing application performance.

  • Analyze query execution plans using EXPLAIN and EXPLAIN ANALYZE to identify bottlenecks
  • Design and implement effective indexes (B-Tree, Hash, GIN, GiST, BRIN) for different query patterns
  • Optimize JOIN operations, WHERE clauses, and SELECT statements to reduce query cost
  • Monitor database performance using pg_stat_statements and identify missing or unused indexes
  • Apply best practices for index maintenance including VACUUM, ANALYZE, and REINDEX operations
  • Recognize and avoid common pitfalls like over-indexing, implicit type conversion, and function-based WHERE clauses

How to install sql-optimization-patterns

npx skills add https://github.com/wshobson/agents --skill sql-optimization-patterns
Claude Code
Cursor
Windsurf
Cline

How to use sql-optimization-patterns

  1. 1.Run EXPLAIN ANALYZE on your slow query to examine the execution plan and identify costly operations
  2. 2.Check for sequential scans on large tables and determine if an index would help
  3. 3.Create appropriate indexes based on your WHERE clauses, JOIN conditions, and data types (B-Tree for equality/range, GIN for full-text/JSONB)
  4. 4.Verify index usage by re-running EXPLAIN ANALYZE after index creation
  5. 5.Monitor query performance using pg_stat_statements to track improvements and find new bottlenecks
  6. 6.Regularly run ANALYZE to update table statistics and VACUUM to maintain performance

Use cases

Good for
  • Debugging a slow-running query by analyzing its EXPLAIN plan and identifying sequential scans that should use indexes
  • Designing a composite index on (user_id, status) to optimize filtered order lookups in a high-traffic application
  • Creating a partial index on active users to reduce index size and improve write performance
  • Implementing full-text search with GIN indexes for efficient document searching
  • Identifying and removing unused indexes that are slowing down INSERT/UPDATE/DELETE operations
Who it's for
  • Database administrators optimizing production systems
  • Backend engineers improving application response times
  • Data engineers designing performant schemas
  • DevOps engineers reducing database costs and load
  • Full-stack developers debugging N+1 query problems

sql-optimization-patterns FAQ

What's the difference between EXPLAIN and EXPLAIN ANALYZE?

EXPLAIN shows the estimated query plan without executing the query. EXPLAIN ANALYZE actually runs the query and shows real execution statistics including actual time and rows processed, making it more accurate for identifying true bottlenecks.

When should I use a composite index vs separate indexes?

Use a composite index when queries filter on multiple columns together (e.g., WHERE user_id = 5 AND status = 'active'). The order matters: put the most selective column first. Separate indexes are better when queries use different column combinations.

How do I know if an index is actually being used?

Run EXPLAIN ANALYZE on your query and look for 'Index Scan' or 'Index Only Scan' in the plan. You can also query pg_stat_user_indexes to find indexes with idx_scan = 0, which are never used and should be dropped.

Why is my index not being used even though I created it?

Common causes include: implicit type conversion (comparing string to integer), functions in WHERE clause (LOWER(email) = 'x'), LIKE with leading wildcard ('%abc'), or outdated statistics. Run ANALYZE to update statistics and ensure your WHERE clause matches the index definition exactly.

How often should I rebuild indexes and run VACUUM?

Run ANALYZE regularly (daily for active databases) to update statistics. VACUUM should run automatically via autovacuum in most setups. Only use VACUUM FULL or REINDEX when you have significant bloat or corruption, as they lock tables.

Full instructions (SKILL.md)

Source of truth, from wshobson/agents.


name: sql-optimization-patterns description: Master SQL query optimization, indexing strategies, and EXPLAIN analysis to dramatically improve database performance and eliminate slow queries. Use when debugging slow queries, designing database schemas, or optimizing application performance.

SQL Optimization Patterns

Transform slow database queries into lightning-fast operations through systematic optimization, proper indexing, and query plan analysis.

When to Use This Skill

  • Debugging slow-running queries
  • Designing performant database schemas
  • Optimizing application response times
  • Reducing database load and costs
  • Improving scalability for growing datasets
  • Analyzing EXPLAIN query plans
  • Implementing efficient indexes
  • Resolving N+1 query problems

Core Concepts

1. Query Execution Plans (EXPLAIN)

Understanding EXPLAIN output is fundamental to optimization.

PostgreSQL EXPLAIN:

-- Basic explain
EXPLAIN SELECT * FROM users WHERE email = 'user@example.com';

-- With actual execution stats
EXPLAIN ANALYZE
SELECT * FROM users WHERE email = 'user@example.com';

-- Verbose output with more details
EXPLAIN (ANALYZE, BUFFERS, VERBOSE)
SELECT u.*, o.order_total
FROM users u
JOIN orders o ON u.id = o.user_id
WHERE u.created_at > NOW() - INTERVAL '30 days';

Key Metrics to Watch:

  • Seq Scan: Full table scan (usually slow for large tables)
  • Index Scan: Using index (good)
  • Index Only Scan: Using index without touching table (best)
  • Nested Loop: Join method (okay for small datasets)
  • Hash Join: Join method (good for larger datasets)
  • Merge Join: Join method (good for sorted data)
  • Cost: Estimated query cost (lower is better)
  • Rows: Estimated rows returned
  • Actual Time: Real execution time

2. Index Strategies

Indexes are the most powerful optimization tool.

Index Types:

  • B-Tree: Default, good for equality and range queries
  • Hash: Only for equality (=) comparisons
  • GIN: Full-text search, array queries, JSONB
  • GiST: Geometric data, full-text search
  • BRIN: Block Range INdex for very large tables with correlation
-- Standard B-Tree index
CREATE INDEX idx_users_email ON users(email);

-- Composite index (order matters!)
CREATE INDEX idx_orders_user_status ON orders(user_id, status);

-- Partial index (index subset of rows)
CREATE INDEX idx_active_users ON users(email)
WHERE status = 'active';

-- Expression index
CREATE INDEX idx_users_lower_email ON users(LOWER(email));

-- Covering index (include additional columns)
CREATE INDEX idx_users_email_covering ON users(email)
INCLUDE (name, created_at);

-- Full-text search index
CREATE INDEX idx_posts_search ON posts
USING GIN(to_tsvector('english', title || ' ' || body));

-- JSONB index
CREATE INDEX idx_metadata ON events USING GIN(metadata);

3. Query Optimization Patterns

Avoid SELECT *:

-- Bad: Fetches unnecessary columns
SELECT * FROM users WHERE id = 123;

-- Good: Fetch only what you need
SELECT id, email, name FROM users WHERE id = 123;

Use WHERE Clause Efficiently:

-- Bad: Function prevents index usage
SELECT * FROM users WHERE LOWER(email) = 'user@example.com';

-- Good: Create functional index or use exact match
CREATE INDEX idx_users_email_lower ON users(LOWER(email));
-- Then:
SELECT * FROM users WHERE LOWER(email) = 'user@example.com';

-- Or store normalized data
SELECT * FROM users WHERE email = 'user@example.com';

Optimize JOINs:

-- Bad: Cartesian product then filter
SELECT u.name, o.total
FROM users u, orders o
WHERE u.id = o.user_id AND u.created_at > '2024-01-01';

-- Good: Filter before join
SELECT u.name, o.total
FROM users u
JOIN orders o ON u.id = o.user_id
WHERE u.created_at > '2024-01-01';

-- Better: Filter both tables
SELECT u.name, o.total
FROM (SELECT * FROM users WHERE created_at > '2024-01-01') u
JOIN orders o ON u.id = o.user_id;

Detailed patterns and worked examples

Detailed pattern documentation lives in references/details.md. Read that file when the navigation tier above is insufficient.

Best Practices

  1. Index Selectively: Too many indexes slow down writes
  2. Monitor Query Performance: Use slow query logs
  3. Keep Statistics Updated: Run ANALYZE regularly
  4. Use Appropriate Data Types: Smaller types = better performance
  5. Normalize Thoughtfully: Balance normalization vs performance
  6. Cache Frequently Accessed Data: Use application-level caching
  7. Connection Pooling: Reuse database connections
  8. Regular Maintenance: VACUUM, ANALYZE, rebuild indexes
-- Update statistics
ANALYZE users;
ANALYZE VERBOSE orders;

-- Vacuum (PostgreSQL)
VACUUM ANALYZE users;
VACUUM FULL users;  -- Reclaim space (locks table)

-- Reindex
REINDEX INDEX idx_users_email;
REINDEX TABLE users;

Common Pitfalls

  • Over-Indexing: Each index slows down INSERT/UPDATE/DELETE
  • Unused Indexes: Waste space and slow writes
  • Missing Indexes: Slow queries, full table scans
  • Implicit Type Conversion: Prevents index usage
  • OR Conditions: Can't use indexes efficiently
  • LIKE with Leading Wildcard: LIKE '%abc' can't use index
  • Function in WHERE: Prevents index usage unless functional index exists

Monitoring Queries

-- Find slow queries (PostgreSQL)
SELECT query, calls, total_time, mean_time
FROM pg_stat_statements
ORDER BY mean_time DESC
LIMIT 10;

-- Find missing indexes (PostgreSQL)
SELECT
    schemaname,
    tablename,
    seq_scan,
    seq_tup_read,
    idx_scan,
    seq_tup_read / seq_scan AS avg_seq_tup_read
FROM pg_stat_user_tables
WHERE seq_scan > 0
ORDER BY seq_tup_read DESC
LIMIT 10;

-- Find unused indexes (PostgreSQL)
SELECT
    schemaname,
    tablename,
    indexname,
    idx_scan,
    idx_tup_read,
    idx_tup_fetch
FROM pg_stat_user_indexes
WHERE idx_scan = 0
ORDER BY pg_relation_size(indexrelid) DESC;