SkillsNav
Home

Data

58 skills · sorted by GitHub stars

networkx
NetworkX is a Python package for creating, manipulating, and analyzing complex networks and graphs.
★ 17K repodata
sympy
SymPy is a Python library for symbolic mathematics that enables exact computation using mathematical
★ 14K repodata
astropy
Astropy is the core Python package for astronomy, providing essential functionality for astronomical
★ 5.2K repodata
biopython
Biopython is a comprehensive set of freely available Python tools for biological computation. It pro
★ 5.1K repodata
using-neon
Neon is a serverless Postgres platform that separates compute and storage to offer autoscaling, bran
★ 69 repodata
data-structure-protocol
Give agents persistent structural memory of a codebase — navigate dependencies, track public APIs, a
★ 52 repodata
alpha-vantage
Access 20+ years of global financial data: equities, options, forex, crypto, commodities, economic i
data
amplitude-automation
Automate Amplitude tasks via Rube MCP (Composio): events, user activity, cohorts, user identificatio
data
analytics-product
Analytics de produto — PostHog, Mixpanel, eventos, funnels, cohorts, retencao, north star metric, OK
data
analytics-tracking
Design, audit, and improve analytics tracking systems that produce reliable, decision-ready data.
data
base
Database management, forms, reports, and data operations with LibreOffice Base.
data
cirq
Cirq is Google Quantum AI's open-source framework for designing, simulating, and running quantum cir
data
claimable-postgres
Provision instant temporary Postgres databases via Claimable Postgres by Neon (pg.new). No login or
data
data-engineer
Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implem
data
data-engineering-data-driven-feature
Build features guided by data insights, A/B testing, and continuous measurement using specialized ag
data
data-engineering-data-pipeline
You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective d
data
data-quality-frameworks
Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when b
data
data-scientist
Expert data scientist for advanced analytics, machine learning, and statistical modeling. Handles co
data
data-storytelling
Transform raw data into compelling narratives that drive decisions and inspire action.
data
database-admin
Expert database administrator specializing in modern cloud databases, automation, and reliability en
data
database-architect
Expert database architect specializing in data layer design from scratch, technology selection, sche
data
database-cloud-optimization-cost-optimize
You are a cloud cost optimization expert specializing in reducing infrastructure expenses while main
data
database-design
Database design principles and decision-making. Schema design, indexing strategy, ORM selection, ser
data
database-migration
Master database schema and data migrations across ORMs (Sequelize, TypeORM, Prisma), including rollb
data
database-migrations-migration-observability
Migration monitoring, CDC, and observability infrastructure
data
database-migrations-sql-migrations
SQL database migrations with zero-downtime strategies for PostgreSQL, MySQL, and SQL Server. Focus o
data
database-optimizer
Expert database optimizer specializing in modern performance tuning, query optimization, and scalabl
data
dbt-transformation-patterns
Production-ready patterns for dbt (data build tool) including model organization, testing strategies
data
drizzle-orm-expert
Expert in Drizzle ORM for TypeScript — schema design, relational queries, migrations, and serverless
data
firecrawl-scraper
Deep web scraping, screenshots, PDF parsing, and website crawling using Firecrawl API. Use when you
data
matplotlib
Matplotlib is Python's foundational visualization library for creating static, animated, and interac
data
mixpanel-automation
Automate Mixpanel tasks via Rube MCP (Composio): events, segmentation, funnels, cohorts, user profil
data
monte-carlo-monitor-creation
Guides creation of Monte Carlo monitors via MCP tools, producing monitors-as-code YAML for CI/CD dep
data
monte-carlo-prevent
Surfaces Monte Carlo data observability context (table health, alerts, lineage, blast radius) before
data
monte-carlo-push-ingestion
Expert guide for pushing metadata, lineage, and query logs to Monte Carlo from any data warehouse.
data
monte-carlo-validation-notebook
Generates SQL validation notebooks for dbt PR changes with before/after comparison queries.
data
neon-postgres
Expert patterns for Neon serverless Postgres, branching, connection pooling, and Prisma/Drizzle inte
data
nosql-expert
Expert guidance for distributed NoSQL databases (Cassandra, DynamoDB). Focuses on mental models, que
data
plotly
Interactive visualization library. Use when you need hover info, zoom, pan, or web-embeddable charts
data
polars
Fast in-memory DataFrame library for datasets that fit in RAM. Use when pandas is too slow but data
data
postgres-best-practices
Postgres performance optimization and best practices from Supabase. Use this skill when writing, rev
data
postgresql
Design a PostgreSQL-specific schema. Covers best-practices, data types, indexing, constraints, perfo
data
posthog-automation
Automate PostHog tasks via Rube MCP (Composio): events, feature flags, projects, user profiles, anno
data
prisma-expert
You are an expert in Prisma ORM with deep knowledge of schema design, migrations, query optimization
data
qiskit
Qiskit is the world's most popular open-source quantum computing framework with 13M+ downloads. Buil
data
saas-multi-tenant
Design and implement multi-tenant SaaS architectures with row-level security, tenant-scoped queries,
data
scanpy
Scanpy is a scalable Python toolkit for analyzing single-cell RNA-seq data, built on AnnData. Apply
data
seaborn
Seaborn is a Python visualization library for creating publication-quality statistical graphics. Use
data
segment-automation
Automate Segment tasks via Rube MCP (Composio): track events, identify users, manage groups, page vi
data
segment-cdp
Expert patterns for Segment Customer Data Platform including Analytics.js, server-side tracking, tra
data