Data
58 skills · sorted by GitHub stars
networkx
NetworkX is a Python package for creating, manipulating, and analyzing complex networks and graphs.
sympy
SymPy is a Python library for symbolic mathematics that enables exact computation using mathematical
astropy
Astropy is the core Python package for astronomy, providing essential functionality for astronomical
biopython
Biopython is a comprehensive set of freely available Python tools for biological computation. It pro
using-neon
Neon is a serverless Postgres platform that separates compute and storage to offer autoscaling, bran
data-structure-protocol
Give agents persistent structural memory of a codebase — navigate dependencies, track public APIs, a
alpha-vantage
Access 20+ years of global financial data: equities, options, forex, crypto, commodities, economic i
amplitude-automation
Automate Amplitude tasks via Rube MCP (Composio): events, user activity, cohorts, user identificatio
analytics-product
Analytics de produto — PostHog, Mixpanel, eventos, funnels, cohorts, retencao, north star metric, OK
analytics-tracking
Design, audit, and improve analytics tracking systems that produce reliable, decision-ready data.
base
Database management, forms, reports, and data operations with LibreOffice Base.
cirq
Cirq is Google Quantum AI's open-source framework for designing, simulating, and running quantum cir
claimable-postgres
Provision instant temporary Postgres databases via Claimable Postgres by Neon (pg.new). No login or
data-engineer
Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implem
data-engineering-data-driven-feature
Build features guided by data insights, A/B testing, and continuous measurement using specialized ag
data-engineering-data-pipeline
You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective d
data-quality-frameworks
Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when b
data-scientist
Expert data scientist for advanced analytics, machine learning, and statistical modeling. Handles co
data-storytelling
Transform raw data into compelling narratives that drive decisions and inspire action.
database-admin
Expert database administrator specializing in modern cloud databases, automation, and reliability en
database-architect
Expert database architect specializing in data layer design from scratch, technology selection, sche
database-cloud-optimization-cost-optimize
You are a cloud cost optimization expert specializing in reducing infrastructure expenses while main
database-design
Database design principles and decision-making. Schema design, indexing strategy, ORM selection, ser
database-migration
Master database schema and data migrations across ORMs (Sequelize, TypeORM, Prisma), including rollb
database-migrations-migration-observability
Migration monitoring, CDC, and observability infrastructure
database-migrations-sql-migrations
SQL database migrations with zero-downtime strategies for PostgreSQL, MySQL, and SQL Server. Focus o
database-optimizer
Expert database optimizer specializing in modern performance tuning, query optimization, and scalabl
dbt-transformation-patterns
Production-ready patterns for dbt (data build tool) including model organization, testing strategies
drizzle-orm-expert
Expert in Drizzle ORM for TypeScript — schema design, relational queries, migrations, and serverless
firecrawl-scraper
Deep web scraping, screenshots, PDF parsing, and website crawling using Firecrawl API. Use when you
matplotlib
Matplotlib is Python's foundational visualization library for creating static, animated, and interac
mixpanel-automation
Automate Mixpanel tasks via Rube MCP (Composio): events, segmentation, funnels, cohorts, user profil
monte-carlo-monitor-creation
Guides creation of Monte Carlo monitors via MCP tools, producing monitors-as-code YAML for CI/CD dep
monte-carlo-prevent
Surfaces Monte Carlo data observability context (table health, alerts, lineage, blast radius) before
monte-carlo-push-ingestion
Expert guide for pushing metadata, lineage, and query logs to Monte Carlo from any data warehouse.
monte-carlo-validation-notebook
Generates SQL validation notebooks for dbt PR changes with before/after comparison queries.
neon-postgres
Expert patterns for Neon serverless Postgres, branching, connection pooling, and Prisma/Drizzle inte
nosql-expert
Expert guidance for distributed NoSQL databases (Cassandra, DynamoDB). Focuses on mental models, que
plotly
Interactive visualization library. Use when you need hover info, zoom, pan, or web-embeddable charts
polars
Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data
postgres-best-practices
Postgres performance optimization and best practices from Supabase. Use this skill when writing, rev
postgresql
Design a PostgreSQL-specific schema. Covers best-practices, data types, indexing, constraints, perfo
posthog-automation
Automate PostHog tasks via Rube MCP (Composio): events, feature flags, projects, user profiles, anno
prisma-expert
You are an expert in Prisma ORM with deep knowledge of schema design, migrations, query optimization
qiskit
Qiskit is the world's most popular open-source quantum computing framework with 13M+ downloads. Buil
saas-multi-tenant
Design and implement multi-tenant SaaS architectures with row-level security, tenant-scoped queries,
scanpy
Scanpy is a scalable Python toolkit for analyzing single-cell RNA-seq data, built on AnnData. Apply
seaborn
Seaborn is a Python visualization library for creating publication-quality statistical graphics. Use
segment-automation
Automate Segment tasks via Rube MCP (Composio): track events, identify users, manage groups, page vi
segment-cdp
Expert patterns for Segment Customer Data Platform including Analytics.js, server-side tracking, tra