python coding tips for claude code etc

Looks

Cloudflare (domains, R2, D1, Workers, Containers, Pages, AI Gateway, Queues, KV, Calls/RealtimeKit).
Use D1 unless there are limitations or need Postgres-specific features (JSONB, FTS, PostGIS).
While Cloudflare Workers support Python also consider whether Typescript workers are more reliable.
When creating R2 buckets, enable Data Catalog explicitly (not default).
Multiple cloud accounts/domains may exist across providers.
For low latency, prefer WebRTC (for audio/video), WebSockets, SSE, UDP, over REST.

Use Python 3.12
uv, uv workspaces, ty (LSP), pytest, ruff, type hints on all functions.
FastAPI + SQLModel (Pydantic + SQLAlchemy combined) for APIs and databases.
Prefer serverless approaches that work in Containers or Workers over virtual machines.
Always try to use SQLModel and fallback to python dataclasses if necessary.
Use exchange_calendars for trading holidays by exchange.
Concise comments explaining intent. Tests with mocks/fixtures. 4-space indent.
Be mindful of timezones; Modal.com uses UTC/GMT.

Use gymnasium, not OpenAI gym (deprecated).
Prefer polars over pandas; use numpy for numerical compute.
Store tabular data as parquet (zstd compression if available).
DuckDB for analytical SQL queries on parquet/polars. dbt for SQL.
Prefer hardware-accelerated libraries: polars (SIMD), numpy (BLAS/LAPACK), cuDF/RAPIDS (GPU).
Prefer Prefect for orchestration and workflows over Airflow.

Use massive.com (formerly Polygon.io). There is already a running R2 bucket ('warrenbucket') with historical data.
GPU Service: Modal.com (primary) + RunPod.io (backup)
Use huggingface.co
Vector DB: Locally use Qdrant. Keep in mind potential sync in later iterations to Cloudflare Vectorize.
Use a versioned, compatible embedding library that can be used in both local development and Cloudflare Vectorize.
Use wandb for logging and debugging language model inputs, outputs, and traces.

Prefer linear probability model over logit; always use robust (HC) standard errors.
Panel fixed effects: use linearmodels (PanelOLS) -- better than statsmodels for clustered SEs and IV.
Mixed effects (statsmodels.MixedLM): hierarchical data with random intercepts/slopes (repeated measures, nested structures like students within schools). Use when group-level variance matters or predicting for new groups.
Use fixed effects over mixed effects when within-group variation is what identifies causal effects.