Tools & frameworks

The libraries and platforms we reach for at Ephizen — and the ones we keep watching. Each page covers what it does, when to use it, and the gotchas worth knowing before you ship.

Cache / In-memory 1

Redis beginner

You need sub-millisecond access to small hot data — caches, counters, queues, rate limits, session state.

#cache#in-memory#pub-sub 2026-04

Databases 1

MongoDB beginner

Your data is document-shaped, your schema evolves, or you want vector search and operational storage in one place.

#nosql#document-db#atlas 2026-04

Deep Learning Frameworks 4

PyTorch intermediate

Any deep learning work — training from scratch, fine-tuning, research, or deploying custom models.

#deep-learning#python#gpu 2026-04

scikit-learn beginner

Tabular data, prototyping, feature engineering pipelines, and almost any classical ML baseline.

#classical-ml#python#tabular 2026-04

TensorFlow intermediate

You're maintaining an existing TF codebase, targeting mobile/edge via TFLite, or serving models through TF Serving at scale.

#deep-learning#tflite#google 2026-04

XGBoost beginner

Any supervised learning problem on tabular data — especially classification, regression, and ranking.

#tabular#gbdt#classical-ml 2026-04

Infrastructure 4

Docker beginner

You need reproducible environments for training, serving, or local development — especially anything involving CUDA, Python versions, or system libraries.

#containers#devops#infra 2026-04

Kubernetes advanced

You run multiple services at nontrivial scale and need rolling deploys, autoscaling, and a uniform way to manage them.

#k8s#orchestration#devops 2026-04

Postman beginner

You're exploring a third-party API, debugging your own service, or sharing request collections with teammates who don't live in a terminal.

#api#testing#devtool 2026-04

Pydantic beginner

Validating or serializing structured data in Python — API payloads, configuration, LLM outputs, anything with a schema.

#python#validation#types 2026-04

LLM & Agent Frameworks 5

DSPy advanced

You have a well-defined task with examples and want the framework to automatically search over prompts, few-shot demos, and even fine-tunes.

#llm#prompt-optimization#compilers 2026-04

HuggingFace Transformers intermediate

Loading, fine-tuning, or running any pretrained transformer model in Python.

#huggingface#transformers#nlp 2026-04

LangChain intermediate

You need a quick LLM application scaffold with ready-made integrations for vector DBs, document loaders, and LLM providers.

#llm#rag#python 2026-04

LangGraph intermediate

You're building an agent with branching logic, retries, checkpoints, or human-in-the-loop steps and want explicit control over the flow.

#agents#langchain#state-machine 2026-04

LlamaIndex intermediate

You're building a RAG system over a corpus of documents and want ready-made loaders, indexers, and query engines.

#rag#retrieval#llm 2026-04

MLOps 3

LangSmith intermediate

Debugging, evaluating, and monitoring LLM chains or agents in dev and production — especially if you're already on LangChain/LangGraph.

#llm#tracing#evaluation 2026-04

MLflow intermediate

You need experiment tracking and a model registry without buying a full MLops platform, and you want something self-hostable.

#mlops#tracking#registry 2026-04

Ragas intermediate

You have a RAG pipeline and need quantitative metrics beyond eyeballing outputs, especially for regression testing and comparison.

#rag#evaluation#metrics 2026-04

API & Serving 3

FastAPI beginner

You need a typed, async Python HTTP service — especially one that serves ML models, proxies LLM calls, or exposes a RAG pipeline.

#python#api#serving 2026-04

Ollama beginner

You want to run an open LLM on your laptop or a small server with zero setup — demos, prototypes, offline work.

#llm#local#llama-cpp 2026-04

vLLM intermediate

You're self-hosting an open-weights LLM and care about throughput, latency, and GPU utilization.

#llm#serving#gpu 2026-04

Streaming 1

Apache Kafka advanced

You need durable, high-throughput event streaming with multiple independent consumers replaying history.

#streaming#event-driven#pub-sub 2026-04

Vector Databases 4

Chroma beginner

Prototyping, local development, or small production apps where a lightweight embedded store is enough.

#vector-search#embedded#rag 2026-04

pgvector beginner

You already run Postgres and your vector workload is modest to medium — tens of millions of vectors, single-digit ms queries.

#postgres#vector-search#rag 2026-04

Pinecone beginner

You need a production vector store and don't want to operate one yourself. Especially good when you need serverless scaling.

#vector-search#rag#managed 2026-04

Weaviate intermediate

You want an open-source, self-hostable vector store with hybrid (vector + keyword) search out of the box.

#vector-search#hybrid#open-source 2026-04

Visualization 2

Gradio beginner

You want the fastest path from a Python function to a clickable demo, especially for ML models with image, audio, or text I/O.

#python#demo#huggingface 2026-04

Streamlit beginner

You want a quick internal tool or demo around a model and don't want to touch frontend code.

#python#demo#data-app 2026-04

Data Warehouses 2

Databricks intermediate

You're running Spark-scale ETL, training models on terabytes, and want notebook + job + MLflow in one place.

#spark#delta-lake#mlflow 2026-04

Snowflake beginner

You want a no-ops analytics warehouse with strong governance, concurrent BI workloads, and simple SQL semantics.

#warehouse#sql#analytics 2026-04