Deployment options for LangGraph workflows

LangGraph workflows can be deployed through several paths. The LangGraph Platform is a managed service that handles the infrastructure for running, scaling, and monitoring LangGraph applications — it provides an API server, a database for state persistence, and a deployment pipeline. Self-hosted deployment runs the LangGraph application in a container or on a server with the application code and a compatible database for state storage; this option gives more control over infrastructure and data residency at the cost of operational responsibility. For workflows without long-horizon state requirements, lightweight deployments using standard Python web frameworks are also viable if the use case fits a request-response model rather than a stateful long-running workflow.

State persistence in production

LangGraph's checkpointing system requires a persistent state store in production. The default in-memory saver is appropriate for development and testing but loses all workflow state when the process restarts. Production deployments configure an external store — a relational database, a key-value store, or the LangGraph Platform's managed storage — so that interrupted workflows can resume, human-in-the-loop checkpoints persist between interactions, and long-running workflows survive process restarts or scaling events. The state schema must be designed with persistence in mind: all state values need to be serializable to the storage backend, and state migrations need to be managed when the schema changes across deployments.

Streaming, scaling, and monitoring

LangGraph workflows support streaming intermediate outputs during execution, which allows clients to observe agent progress in real time rather than waiting for final results. Configuring streaming correctly in production requires handling the streaming transport at both the server and client level. Scaling LangGraph deployments involves managing concurrency: multiple simultaneous workflow executions require either stateless scaling (if state is fully externalized) or workflow-level routing to ensure each execution touches the same state store. Monitoring production LangGraph deployments requires capturing workflow-level metrics — execution duration, step counts, error rates by node, checkpoint frequency — in addition to standard infrastructure metrics.