Multiplayer Blog

The (not so) hidden cost of custom logging

Vladi Stevanovic — Thu, 26 Feb 2026 11:18:43 GMT

If you're a technical leader, there's a good chance your team is spending significant time on custom logging… and you might not even realize how much it's costing you in productivity and incomplete debugging data.

Let's start with a simple test. When a production incident hits, can your team immediately answer these (or similar) questions?

	With Custom Logging	With Multiplayer
What did we send to Stripe?	🟡 Maybe (if someone logged it)	✅ Always
What did Stripe return?	🔴 Probably not (response often not logged)	✅ Always
What about that Twilio call?	🔴 Likely missing (one-off integration)	✅ Captured
What about the AWS S3 upload?	🔴 Almost certainly not logged	✅ Captured
Can I see the full request chain?	🔴 Scattered across logs, traces, dashboards	✅ One timeline

If you're seeing mostly yellow and red on the left side, your team has a custom logging problem. And it's costing you more than you think.

What is custom logging (and why do teams do it)?

Custom logging is when developers manually add code to capture and record information about what's happening in your application. Specifically, the request and response data flowing through your system.

Here's what it looks like in practice:

# Before calling Stripe API
logger.info("Calling Stripe charge API", extra={
    "amount": 1000,
    "currency": "usd",
    "customer_id": "cus_123"
})

# Make the API call
response = stripe.Charge.create(amount=1000, currency="usd")

# After getting the response
logger.info("Stripe response received", extra={
    "charge_id": response.id,
    "status": response.status,
    "response": response.to_dict()
})

Standard observability tools (Datadog, New Relic, Honeycomb, etc.) capture that API calls happened. You can see "POST to Stripe, 200ms, success", but they don't capture what data was in those calls. To understand what actually went wrong during a bug, you need the request and response content: what you sent, what came back, and where it broke.

So teams add custom logging everywhere: in checkout flows, payment processing, email sending, file uploads, database queries. Anywhere they might need to debug later.

The problem: it requires constant manual discipline

Custom logging sounds reasonable in theory. Just add logging when you write the code, right? But here's what actually happens:

"I'll add it later" (never happens)

During a rushed bug fix or feature launch, logging gets deprioritized. "Let's just ship this, we'll add proper logging in the next sprint." The next sprint comes with its own priorities. The logging never gets added.

One-off integrations get skipped

Your team integrates a new third-party service, maybe a document signing API or a geocoding service. The main flow gets logged, but edge cases and error paths don't. Six months later, an issue happens in one of those paths, and there's no data to debug with.

New developers forget

A junior developer ships a new feature. They test it, it works, they move on. Three weeks later, that feature breaks in production. The team scrambles to debug it and realizes: there's no logging. No one can see what data the user submitted or what the external API returned.

Response logging is inconsistent

Even when developers remember to log the request to an external API, they often forget to log the response. Or they log success cases but not errors. Or vice versa. During an incident, you're left guessing what the external service actually returned.

The hidden cost: time and incomplete coverage

Here's what custom logging looks like in practice:

	Coverage	Why
Internal services	🟡 80-90%	Middleware helps, but new endpoints slip through
External APIs	🔴 40-60%	Manual wrapping for each call, easy to forget
Setup & Maintenance	🔴 High	Hours and weeks to setup everything, breaks if not maintained

When custom logging coverage is incomplete, here's what happens during incidents:

Longer debugging time: Engineers spend hours hunting for data that doesn't exist, context-switching between tools, trying to reproduce issues locally.
Ongoing maintenance burden: Every new feature, every new integration, every new external API requires thinking about logging. It's a tax on velocity.
Friction in code reviews: "Did you add logging for this?" becomes a frequent review comment that slows down reviews. Teams try to enforce it through checklists and linters, but it's never perfect.
False confidence: Leadership sees "we have logging" in the observability platform and assumes the team can debug any issue. The reality only becomes clear during incidents when critical data is missing.

How Multiplayer is different

Multiplayer takes a fundamentally different approach: automatic capture instead of manual instrumentation.

Instead of requiring developers to remember to log every API call, Multiplayer automatically captures:

All internal service requests and responses (full content and headers)
All external API calls (Stripe, Twilio, AWS, etc.) with full request/response data
Frontend user actions and network activity
Everything correlated in one timeline

With Multiplayer:

	Coverage	Why
Internal services	🟢 100%	Automatic capture, SDK instrumentation
External APIs	🟢 100%	Automatic capture, no manual wrapping
Setup & Maintenance	🟢 100%	Works automatically, doesn't degrade over time

The session-based difference

The other key difference: Multiplayer uses session-based recording instead of "always-on for all traffic."

Traditional approach (custom logging + observability):

Collect logs and traces for 100% of traffic (or sample heavily to control costs)
Store everything forever
Pay for massive volumes of data you'll never look at
Still missing the payloads you actually need because you didn’t manually instrument for it

Multiplayer approach:

Only record sessions where you need visibility (user-reported bugs, errors, specific conditions)
When you do record, capture everything for that session
Lower cost because you're not ingesting terabytes of generic logs

This means you get complete data where it matters without the cost of logging everything, everywhere, always.

What this means for technical leaders

If you're responsible for engineering productivity and infrastructure costs, here are the questions to ask:

What percentage of our external API calls are actually logged?Ask your team to audit. The answer is usually surprising (and lower than expected).
How much time do engineers spend adding logging to new features?This is a hidden tax on velocity. Every feature requires thinking about and implementing logging.
During the last 5 incidents, how often did we have all the data we needed?"We had to reproduce it locally" or "we couldn't see what Stripe returned" are red flags.
What's our monthly cost for log ingestion and storage?Custom logging creates massive volumes. Even with sampling, the costs add up.
How do junior developers learn what to log and where?If the answer is "code reviews catch it" or "tribal knowledge," coverage will stay incomplete.

The bottom line

Custom logging can technically capture everything, but in practice, it rarely does. Coverage degrades over time, external APIs get forgotten, and during incidents, you're left asking "did anyone log this?" instead of debugging.

Automatic capture solves this by ensuring that when an issue happens, you have the complete context (frontend, backend, external APIs, request / response content and headers) already correlated and ready to investigate.

Your team stops spending time on logging instrumentation and starts spending time on what actually matters: understanding and fixing bugs.

👀 If this is the first time you’ve heard about Multiplayer, you may want to see full stack session recordings in action. You can do that in our free sandbox: sandbox.multiplayer.app

If you’re ready to trial Multiplayer you can start a free plan at any time 👇

Start a free plan

Why observability tools are missing critical debugging data (no matter how you sample)

Vladi Stevanovic — Fri, 20 Feb 2026 15:10:29 GMT

There's a common belief in the observability space: if you just collect more data, you'll have what you need to debug any issue.

The reality is more frustrating: even with 100% unsampled observability, you're still missing critical debugging data.

The problem isn't sampling. It's what observability tools are designed to capture in the first place.

What observability tools actually collect

Traditional observability platforms (Datadog, New Relic, Dynatrace, Grafana, etc.) are excellent at their core job: monitoring distributed systems. They collect:

Traces: The path a request takes through your services
Logs: Events and errors from your applications
Metrics: System health, latency, error rates

This gives you visibility that something happened and where it happened. You can see that Service A called Service B, the call took 250ms, it returned a 500 error and that the error rate spiked at 14:23 UTC.

For traditional monitoring and alerting, this is great. But for debugging complex issues, especially if you’re leveraging AI coding assistants, this data is incomplete.

The three data gaps that sampling doesn't fix

Gap 1: Request/response payloads for internal services

Observability tools capture HTTP metadata (method, URL, status code, response time) and trace headers for correlation. However, they don't capture the actual request and response bodies: the JSON payloads, form data, or API responses your services exchange.

Why this matters:

When a checkout fails with a 400 error, you can see that the /api/checkout endpoint failed. But you can't see:

What data was in the checkout request (cart contents, payment method, shipping address)
What validation error was returned in the response body
What custom headers were passed (tenant ID, feature flags, API version)

The sampling myth:

Making your traces unsampled (capturing 100% instead of 1-10%) doesn't solve this. You still only get metadata, not payloads. You're just getting metadata for every request instead of a sampled subset of requests.

How teams work around this:

Custom logging. You manually instrument your code to log payloads:

logger.info("Checkout request", extra={"body": request.json()})

This works, but:

Requires instrumentation in every service
Creates massive log volumes (and costs)
Lives in a separate tool from your traces
Requires manual correlation with your other data

Gap 2: External API calls are black boxes

Your observability tools can tell you that your backend called Stripe, Twilio, or AWS. The call took some amount of milliseconds. It succeeded or failed. However, they don't capture:

What you sent to the external API
What the external API returned
Error details from the external service

Why this matters:

A payment fails. Your trace shows:

POST /api/checkout → calls Stripe → 400 error → returns error to user

But you can't see:

The payment payload you sent to Stripe (amount, currency, customer ID)
Stripe's error response ("Currency mismatch: customer configured for EUR, request sent USD")
Whether retry logic was triggered

The sampling myth:

Unsampled observability still sees external APIs as opaque boxes. Distributed tracing stops at your system boundary. To see what Stripe returned, you need to:

Log into Stripe's dashboard
Search by timestamp or customer ID
Manually correlate with your trace

How teams work around this:

Manual logging before and after every external call:

logger.info("Calling Stripe", extra={"payload": stripe_data})
response = stripe.Charge.create(**stripe_data)
logger.info("Stripe response", extra={"response": response.to_dict()})

This is tedious, error-prone, and easy to forget, especially for one-off integrations.

Gap 3: Frontend context is disconnected

Many observability tools focus only on backend traces and logs. However, some also have RUM / session replay functionalities in their toolkit (e.g. Datadog RUM) or integrations with third-party frontend tools (e.g. Sentry, LogRocket, etc.).

Ultimately, however, obtaining full stack visibility is not as simple as it may seem: it requires more tooling instrumentation, maybe a separate subscription and, oftentimes, manual ID correlation.

In short, you don’t always get an automatic correlation between "what the user did" and "what happened in the backend."

Why this matters:

A user reports: "Checkout is broken." You need to:

Get the timestamp from the session replay
Find the corresponding backend trace by matching timestamps (hoping there's no clock skew)
Correlate the request ID if it was propagated (often it wasn't)

The sampling myth:

Even with 100% unsampled backend traces and full frontend session capture, correlation is still manual. The tools don't automatically link "this user session" to "these backend traces" unless you've explicitly propagated session IDs everywhere and configured correlation across platforms.

The complexity trap

Here's what happens when teams try to solve these gaps with traditional observability tools:

Level 1: Observability (sampled)

Cost: $ (manageable)
Traces & logs: 1-10% sampled
Internal payloads: ❌ By default, you capture only metadata about the request (method, URL, status)
External APIs: ❌ Black boxes
Frontend: ❌ No visibility
Correlation: ❌ No

Level 2: Observability + RUM

The main differences are:

Cost: $$ (increased due to more tools and instrumentation overhead)
Frontend: ✅ Visual replay, console logs, browser network, etc.
Correlation: ⚠️ Manual setup required to propagate session IDs, configure tools, etc.

Level 3: Unsampled observability + RUM

The main differences are:

Cost: $$$ (high, you’re ingesting and storing 10-100x more data)
Traces & logs: ✅ 100% captured

Level 4: Unsampled observability + RUM + custom payload logging

The main differences are:

Cost: $$$$ (very high—now logging full payloads for everything)
Internal payloads: ✅ Available (if you instrumented it)
External APIs: ⚠️ Available (if you manually logged every call)

Notice the pattern: complexity and cost increase, but fundamental gaps remain.

What's needed instead: auto-correlated, session-based capture

The problem with traditional observability is the collection model itself: "collect everything, all the time, hope you have what you need later."

There’s a different approach:

Collect on-demand: Don't record every request. Record specific sessions where you need visibility (user-reported bugs, errors, failed transactions).
Capture everything for those sessions: When you do record, get the complete picture: full traces, request/response payloads (internal and external), frontend context, all automatically correlated.
Store it together: Don't split data across observability (traces), logging (payloads), RUM (frontend), and external dashboards (Stripe). Keep it in one place, already linked.

This is what tools like Multiplayer, with its session-based recording, are built for:

Cost: $ (low: only recording specific sessions, not all traffic)
Traces & logs: ✅ Unsampled per session
Internal payloads: ✅ Full content & headers out of the box
External APIs: ✅ Full content & headers (Stripe, Twilio, AWS), out of the box
Frontend: ✅ Full capture (visual replay, console, network, device info)

Correlation: ✅ Automatic (everything linked by session)

The benefit for humans and AI tools

Having auto-correlated, full-stack data matters even more when you're using AI to debug.

AI tools and agents are only as good as the data you give them. When you paste a stack trace and some logs into Copilot or Claude, the AI pattern-matches against common issues. It's guessing based on incomplete information.

But if you can give the AI:

The full user journey (what they clicked, in order)
The exact request the frontend sent
What your backend received and how it processed it
What you sent to Stripe and what Stripe returned
Where in that chain it broke

...now the AI can provide specific diagnosis based on what actually happened in your system.

Getting that complete picture with traditional observability, even unsampled, is much more costly, time-consuming and effort-intense. You'd need to manually gather fragments of information from multiple data silos and tools and then paste them all in.

The path forward

If you're an engineering leader evaluating observability tools, ask these questions:

What does it cost to have complete, unsampled data for the requests I care about?
- If the answer involves "collect everything 24/7," the cost will be prohibitive at scale.
Can this tool capture request/response payloads for internal services without custom instrumentation?
- If no, you're signing up for ongoing instrumentation work and incomplete coverage.
Can this tool automatically capture what I send to and receive from external APIs (Stripe, Twilio, AWS)?
- If no, external services remain black boxes unless you manually log every call.
Does this tool automatically correlate frontend user sessions with backend behavior?
- If no, you're manually stitching data together during incidents.

The goal isn't more data. It's the right data, captured automatically, already correlated, available when you need it. Whether you're debugging manually or using AI to help.

Start a free plan

With Spec-Driven Development we've come full circle to Big Design Up Front

Thomas Johnson — Tue, 17 Feb 2026 18:09:17 GMT

Intro

When I first read GitHub’s article about spec-driven development, my first impression was: “Wait a minute. I’ve heard this before…”

Here's how they describe it:

You provide a high-level description of what you’re building and why […] It’s about user journeys, experiences, and what success looks like. […] Now you get technical. In this phase, you provide the coding agent with your desired stack, architecture, and constraints, and the coding agent generates a comprehensive technical plan…

Does this sound familiar? It should.

We used to call this Big Design Up Front (BDUF). We spent decades running away from it. And now we're sprinting right back, rebranding it as innovation for the AI era. From GitHub:

[…] we’re rethinking specifications — not as static documents, but as living, executable artifacts that evolve with the project. Specs become the shared source of truth. When something doesn’t make sense, you go back to the spec; when a project grows complex, you refine it; when tasks feel too large, you break them down.

The irony is so thick you could easily represent it with a UML diagram. Documentation was always meant to be a living, evolving artifact. Engineering teams should have always started with some upfront system design before jumping into code.

But, also, implementing those best practices was harder than it sounded. Documentation was outdated the moment you pushed your next commit, writing it well was a days-long slog, and no amount of good intentions survived the pressure to ship new features.

So what’s changed? The biggest difference between BDUF and spec-driven development is that AI changes the economics of doing it well.

For the first time, writing a thorough, living specification isn't a weeks-long documentation marathon. It's a few hours (minutes?) of conversation.

That’s why teams can and should adopt it now. Spec-driven development with AI is genuinely better than BDUF and it will make your team more productive.

A quick trip through history

To understand why this matters, it's worth remembering what we're comparing it to.

In the waterfall days, BDUF was gospel. You'd spend months perfecting every requirement, every architecture diagram, every interface specification before writing a single line of code. The theory was sound: think deeply before building, catch problems early, avoid expensive rewrites.

The reality was brutal. Requirements changed while you were still in month three of design reviews. Technologies evolved. Markets shifted. By the time you finished your perfect blueprint, it was already obsolete. And when implementation finally started, reality had an annoying habit of disagreeing with your theoretical perfection.

Then the Agile Manifesto arrived and we all collectively exhaled. Finally. No more soul-crushing design documents that nobody reads.

But here's what happened: a lot of teams heard "Responding to change over following a plan," and translated it to "don't plan at all." They took "working software over comprehensive documentation" to mean "no documentation."

The pendulum swung from BDUF paralysis to "we'll figure it out as we go" chaos.

I've written about why this is a problem before. Modern distributed systems are too complex to hold in your head. Without some upfront design, you end up with:

Teams working from different mental models of the same system
Architecture decisions made by accident rather than intention
Critical trade-offs (security, scalability, cost) discovered too late
Knowledge that lives in someone's head until they leave

The answer isn't BDUF. But it's also not "no design." It’s somewhere in the middle: enough upfront design to establish shared vision, identify significant risks, and make conscious trade-offs, without the paralysis of trying to specify everything in advance.

We always knew this. We just struggled to do it consistently, because it takes discipline, the benefits feel distant, and there's always pressure to just start coding.

AI tools are changing that calculus.

Why AI agents need structure and how they make it easier

Here's the thing about AI coding tools and agents: they need specificity and structure.

Why? Because they lack the capabilities that let human developers work with ambiguity:

They lack context beyond the prompt. When you tell a senior engineer "build an authentication system," they know you're already using Auth0 for your customer portal, and follow the same pattern for consistency. An AI agent? It might give you a perfectly functional JWT implementation from scratch, or suggest Passport.js, or recommend Firebase Auth. All reasonable choices in isolation, but potentially creating fragmentation in your architecture. They don't have access to your team's tribal knowledge, past design decisions, or the unwritten "we standardized on XYZ" agreements.
They don't think defensively or double-check edge cases. A human developer building a user registration endpoint will pause and think: "Wait, what if someone registers with the same email twice? What if the email service is down? What about internationalized characters in names?". They'll ping you on Slack if they're unsure. An AI agent will confidently write the happy path and keep moving.
They struggle with ambiguity. Tell a human developer "make it performant" and they'll ask clarifying questions or make reasonable assumptions based on the application type. Tell an AI agent "make it performant" and you might get... literally anything. But write "Ensure 95th percentile response time stays under 200ms" and suddenly they have something concrete to optimize for.

That’s why GitHub created Spec Kit, a toolkit that formalizes a four-phase process: Specify (user journeys, experiences, outcomes) → Plan (stack, architecture, constraints) → Tasks (small, reviewable chunks) → Implement (AI generates code).

But here's what's different from BDUF: you don't have to write all of this yourself, from scratch, in a vacuum. The process is collaborative and iterative, with AI doing the heavy lifting at every stage.

What this actually looks like in practice

At each stage of this iterative conversation with AI tools, the humans are doing the high-value work: making judgment calls, catching misalignments, applying context the AI doesn't have. The AI is doing the drafting, the structuring, the fleshing-out.

What used to take days of writing and review meetings now takes hours.

The barrier to actually doing upfront design properly (the time cost, the documentation overhead, the discipline required) drops dramatically.

"We need to do this for the AI tools to work well" means that engineering teams finally:

Write clear specifications before coding
Document architectural decisions while they're fresh (not six months later when someone asks "why did we build it this way?")
Break down work into reviewable, testable chunks
Maintain living documentation that evolves with the system (actually keeping it updated as things change)

The humans get better collaboration, clearer communication, and knowledge that survives beyond individual team members. The AI gets the context it needs to generate useful code. Future developers (human or AI) inherit comprehensible systems instead of archaeological dig sites.

Where this really pays off: parallel AI agents

The single most compelling argument for spec-driven development is what happens when you start running multiple AI coding agents in parallel.

This is where structured upfront design becomes essential: when you have several agents working simultaneously on different parts of a system (e.g. one building the API layer, another handling the data model, another working on the frontend integration) the spec and the task breakdown become the coordination mechanism. They're how the agents know what they're each responsible for, what interfaces they can rely on from each other, and what the boundaries of their work are.

Without that shared source of truth, parallel agents will do what parallel human teams do without proper coordination: make inconsistent assumptions, build to incompatible interfaces, duplicate work, and create integration problems that are expensive to untangle later.

With a well-structured spec and clearly decomposed tasks, each agent has exactly the context it needs. It knows the overall architecture. It knows the constraints. It knows what it owns and what it depends on. The result is faster development that actually fits together.

Full circle, but wiser

So yes, there's a family resemblance between spec-driven development and BDUF. Both involve designing before you build. Both emphasize clear requirements and structured planning.

But the differences matter. BDUF was slow, manual, expensive, and brittle. Spec-driven development with AI is fast, iterative, collaborative, and maintainable.

More importantly, it's actually achievable.

The reason teams skipped upfront design wasn't that they didn't believe in it, it was because the cost was too high relative to the short-term pressure to just start coding. AI changes that equation. When a solid spec takes hours instead of weeks, the discipline becomes sustainable.

We've come full circle, but we've brought better tools.

One more thing AI agents need

Better specs and clearer task decomposition are a significant step forward. But specs and plans describe intentions. What AI agents also need (especially for debugging, iteration, and understanding production behavior) is visibility into what systems actually do at runtime.

Your AI coding assistant can't debug a production issue if your observability stack only shows sampled traces. It can't understand a user-reported bug if you can't connect frontend errors to backend failures. It can't suggest meaningful fixes if it can't see the complete context of what actually happened.

The same trend that's pushing us toward better design practices upfront should also push us toward better observability practices in production. AI agents are revealing gaps in both, and closing those gaps is what will make AI-assisted development genuinely powerful, end to end.

Start a free plan

Why AI can't debug your API integrations (yet)

Vladi Stevanovic — Thu, 22 Jan 2026 17:20:44 GMT

AI coding assistants have transformed how we write code. For example, GitHub Copilot, Cursor, and ChatGPT can generate Stripe integration boilerplate in seconds. They'll scaffold your payment flow, suggest error handling patterns, and even write unit tests.

But when your Stripe integration breaks in production can AI actually help you debug it? The honest answer: not really. At least, not yet.

Here’s why.

The limitation: AI needs context it can't get on its own

When you ask an AI assistant "why is my Stripe payment failing?", it responds with educated guesses based on common patterns:

"Check if the card is expired"
"Verify you're using the correct currency format"
"Ensure you're handling insufficient funds errors"
"Confirm your API keys are valid"

These are all reasonable suggestions. They're based on what usually causes Stripe payment failures across thousands of codebases the AI was trained on. But the AI doesn't know what actually happened in your specific case. It doesn't have access to:

What payload did your frontend send to your backend?
What request did your backend construct and send to Stripe?
What response did Stripe return?
How did your backend process that response?
What error (if any) made it back to the user?

Without this runtime context, the AI is pattern-matching. It's giving you a troubleshooting checklist, not a diagnosis.

The problem: getting context for external APIs is effort-intensive

The irony is that the data AI needs often exists, it's just scattered and difficult to access.

When a Stripe integration breaks, you need to see the complete request/response exchange: what you sent them, what they returned, and how your system handled it. This is where traditional debugging approaches fall short, particularly for external API calls.

APM tools show that you made the call and how long it took, but not the payload exchange. Most Application Performance Monitoring platforms (Datadog, New Relic, Dynatrace) can track that your backend called stripe.charges.create() and that it took 340ms. They might even show it returned a 400 error. But they typically don't capture the full request body you sent or the detailed error response Stripe returned. At least not by default.

In theory, APM tools CAN capture this data IF properly instrumented. You can configure custom spans, add metadata attributes, and enrich traces with payload information. But this requires:

Configuration complexity: Custom instrumentation for each external API integration
Cost considerations: Full payload capture dramatically increases data volumes and APM bills
Intentional redaction: Many teams deliberately avoid logging payment data due to PCI compliance requirements

The result? When debugging external API failures, most teams end up manually gathering context from multiple sources:

Check your application logs (CloudWatch, Splunk) for what you sent
Check Stripe's dashboard for their logs of the request
Check your error monitoring (Sentry, Rollbar) for the exception
Check your frontend session replay to see what the user experienced
Manually correlate timestamps, request IDs, and user sessions across all of these

Once you've spent 30-60 minutes gathering this fragmented context, then you can paste it into an AI assistant and ask for help. But at that point, you've already done most of the debugging work yourself.

A real example: the AMEX one-time code bug

Let's walk through a concrete scenario to see where AI debugging breaks down.

The Problem:

After deploying new payment features, customers complain they don't receive one-time authentication codes on their phones when paying with American Express cards. The issue is intermittent and doesn't affect Visa or Mastercard. Engineering teams suspect an authentication bug but can't reproduce it reliably.

Traditional Debugging Workflow

Step 1: Check error monitoring

Sentry shows some frontend timeout errors during checkout, but no clear stack trace pointing to the root cause. The errors are generic: "Request timeout after 30s."

Step 2: Check application logs

Search CloudWatch for logs around the time customers reported issues. Find log entries showing successful calls to Stripe's API, but the logs don't include full request payloads (they were redacted for PCI compliance).

Step 3: Check Stripe's dashboard

Log into Stripe's dashboard and search for the affected transactions by timestamp and customer email. Finally discover that some requests are receiving authentication_required responses, but your system isn't handling them correctly.

Step 4: Reproduce locally

Try to reproduce the issue in staging with test AMEX cards. It doesn't happen consistently. Realize you need to see the actual production payloads to understand the pattern.

Step 5: Add more logging and wait

Deploy additional logging to capture more details about AMEX transactions. Wait for the issue to occur again.

Total time to diagnosis: Hours to days, depending on how quickly the issue reproduces.

At this point, could you ask AI for help? You could paste your fragmented logs and ask "why might Stripe authentication fail for AMEX?". The AI would suggest checking 3DS configuration, webhook handling, and card type compatibility. All reasonable but generic advice.

Full stack session recording with auto-correlated API data

Now imagine a different debugging workflow:

A customer reports the issue
You pull up the full-stack replay of their session
You see:

The exact checkout form they filled out (frontend)
The API request your backend constructed (POST /v1/payment_intents)
The full payload sent to Stripe, including the discovery that card numbers for AMEX are getting an extra digit appended (aha! that’s the issue!)
Stripe's response: invalid_card_number
How your backend handled this response (incorrectly treating it as a generic timeout)
What the user saw (spinning loader with no error message)

Total time to diagnosis: 5-10 minutes.

Now when you ask AI for help, you can provide the complete context: "Here's the exact payload we sent to Stripe for AMEX cards. Stripe returned invalid_card_number. Our code is adding an extra digit. Why?"

The AI can now give you a specific answer: "Your string concatenation logic in formatCardNumber() is applying AMEX-specific formatting twice. Here's the fix..." Instead of guessing at possibilities, it's debugging actual runtime behavior.

The future: auto-correlation makes AI debugging actually useful

The next generation of debugging doesn’t depend exclusively on the quality of AI models, but it’s heavily dependent on feeding AI tools the context they need to be useful.

Auto-correlation tools like Multiplayer automatically capture and link data across your entire stack: frontend interactions, backend traces and logs, and end-to-end request/response headers and content from internal service and external API calls. This data becomes the foundation for effective AI-assisted debugging.

When AI has access to:

What the user actually did (not what you think they did)
What data your system actually sent (not what it should send)
What external APIs actually returned (not what the docs say they return)
How your code actually processed the response (not what you intended)

... then AI can shift from suggesting possibilities to diagnosing realities.

This is why auto-correlation and AI coding assistants are complementary, not competing technologies. Correlation tools provide the runtime context that transforms AI from a pattern-matcher into a debugger.

Start a free plan

Multiplayer 2025: year in review

Vladi Stevanovic — Mon, 15 Dec 2025 20:10:20 GMT

2025 was a defining year for Multiplayer.

We focused on a simple but ambitious goal: making debugging faster, less fragmented and less manual. That meant meeting developers where they were already working and capturing the right context at the right time.

Across the year, Multiplayer evolved from a powerful session recording tool into a full workflow for understanding what actually happens in production - from the user’s screen all the way to deep backend calls - and acting on that knowledge faster.

Here’s a look back at what we shipped in 2025, and where we’re headed next.

Full stack session recordings

At the core of Multiplayer is the idea that debugging starts with context. This year, we significantly expanded how that context is captured, shared, and used.

Multiple recording modes: You’re in control of what gets captured and when. Record on-demand when users hit issues, or run continuous recording in the background to automatically capture issues and exceptions. No more "I wish we had been recording when that happened."
Annotations: Sketch directly on recordings and annotate any data point collected, from individual timestamps to clicks, traces, and spans. Your team has all the context they need to understand exactly where the UI broke or which log entry needs investigation.
Mobile support: We've shipped React Native support, bringing the same full stack recording capabilities you have on web to your mobile applications. Whether you're troubleshooting a checkout flow on iOS or diagnosing API failures on Android, you get the complete picture.

Example full stack session recording with annotations and a sketch

AI-powered workflows

AI tools are only as useful as the context you give them. In 2025, we focused on making Multiplayer a high-quality data source for AI-assisted debugging.

MCP server: brings full stack session recordings into MCP-compatible AI tools like Cursor, Claude Code, Copilot, Windsurf, and Zed. Instead of feeding your IDE partial context, you give it everything: frontend replays, user actions, backend traces, logs, request/response payloads, and your team's annotations.
VS Code extension: Your full stack session recordings are now available directly in your editor: pull up any recording, review frontend screens, backend traces, logs, request/response content and headers, and jump to the exact line of code where an error occurred.

A full stack session recording in VS Code

Full cycle debugging

Debugging doesn’t end when the issue is fixed. It’s also about learning, documenting, and preventing regressions.

Notebooks: An interactive sandbox for designing, debugging and documenting real-world API integrations. You can also automatically generate test scripts from your full stack session recordings to verify fixes, document real behavior, and prevent regressions.
System architecture auto-documentation: Automatically map all your components, dependencies, and APIs and visualize your application's structure. No more outdated, manually-drawn architecture diagrams - your system maps stays current as your system evolves.

Example Notebook from sandbox

Better onboarding & resources

We also invested heavily in making Multiplayer easier to adopt and understand.

A major documentation overhaul with 34+ new pages covering: use case examples, t ool comparisons (FullStory, LogRocket, Sentry, and others), technical deep-dives on recording modes and data collection
Streamlined onboarding with framework-specific setup guides for React, Next.js, Angular, and Vue.js. Getting the in-app widget running takes just minutes.
A free sandbox and demo app to experience full-stack session recordings before installing anything.

Example system map from sandbox

What's next: 2026 roadmap

The features below are currently in private beta with design partners and enterprise customers, and are planned for GA in early 2026.

Multiplayer AI agent

Automatically receive suggested fixes and pull requests based on issues identified in session recordings.

Today, teams manually feed recordings into AI tools. With the Multiplayer AI agent, this workflow becomes automated: instead of alerts, developers receive actionable PR suggestions grounded in real production context.

Conditional recording mode

Automatically record sessions for specific users or conditions, without manual start/stop and without “always-on recording” overhead.

This allows teams to capture issues even when users don’t report them, eliminating incomplete tickets and giving engineers immediate, actionable context.

Issue tracking

A unified view of errors, exceptions, and performance issues across frontend and backend, all linked to the sessions where they occurred.

User tracking

View all active users and remotely trigger recording conditions during live sessions. Ideal for testing, debugging, and high-touch support scenarios.

Slack integration

Get notified when recordings are created and share session links directly in Slack, keeping context close to where conversations already happen.

Issues detected with Multiplayer

Thank you!

None of this would be possible without the teams who trusted Multiplayer in production, shared feedback candidly, and pushed us to build something better.

Thank you to our users, design partners, community members, and everyone who challenged us to think deeper about debugging, support, and how engineers actually work.

We’re excited for what’s ahead in 2026 and grateful to be building it together. 💜

Start a free plan

Multiplayer vs LogRocket: which session replay tool actually fixes bugs?

Vladi Stevanovic — Thu, 11 Dec 2025 09:21:00 GMT

You're using LogRocket and, for "full stack" visibility you also integrate it with Datadog. A user reports a checkout error. You open the session replay, see the frontend flow, and follow the link to the backend integration... but the trace is sampled out. You see some backend data, but you're missing the actual request payload that failed. Now you're back in Datadog, manually searching for the right trace, trying to find the request content, and piecing together what actually broke.

This is the gap between frontend analytics tools with backend integrations and true full-stack debugging tools.

TL;DR

Choose Multiplayer if: You need to resolve technical issues fast, with complete, unsampled, frontend + backend context in one place, and you need to improve your debugging workflows across multiple teams (e.g. Support → Engineering).

Choose LogRocket if: You primarily need user behavior analytics and frontend monitoring.

Key difference: LogRocket captures frontend behavior with optional sampled backend data through third-party integrations. Multiplayer captures complete, unsampled full-stack sessions (frontend and backend) out of the box, with no integrations required.

Quick comparison

	Multiplayer	LogRocket
Primary use case	Debug technical issues with full-stack context	Product analytics and frontend monitoring
Data captured	Frontend + backend traces, logs, requests/responses (unsampled)	Frontend by default, sampled backend via integrations
Recording control	Multiple recording modes (on-demand, continuous, conditional)	"Always-on" recording
Installation	Browser extension, widget, SDK	SDK only
Collaboration	View, share, and annotate replays	View and share replays
Backend visibility	Native, unsampled, any observability platform	Requires third-party integration, sampled data
AI-native	Feed complete context to your IDE or AI tool of choice via MCP server	Interrogate and summarize session replays with native AI tool

More resources:

The real difference: frontend + integrations vs native full-stack

LogRocket: frontend-first with partial backend visibility

LogRocket captures comprehensive frontend data: clicks, page loads, console logs, network requests. For product analytics and UX monitoring, this works well. You can also integrate with APM tools like Datadog or New Relic to link out to some backend data.

But here's the catch: the backend data is sampled, and critical debugging information is still missing. Even with integrations configured, you don't get:

Complete, unsampled logs and traces (APM sampling means you might miss the exact data you need)
Request/response content and headers from internal service calls

Not to mention that the backend data still lives in a separate tool. When debugging a production issue, you're forced to:

Search LogRocket session replays for the frontend behavior
Follow a link to switch to your APM tool for backend data (hoping it wasn’t sampled out)
Manually correlate timestamps between systems
Still miss critical data like full request/response content and headers from internal services

Multiplayer: complete full-stack context by default

Multiplayer captures full-stack session recordings natively, with zero sampling. Every frontend action is automatically correlated with complete backend traces, logs, and request/response data, in a single timeline.

When that checkout error happens, you see:

The user's click
The API request
The unsampled backend trace showing which service failed
The exact error message and stack trace
Request/response content and headers from internal service calls

No sampling gaps. No tool switching. No missing data.

By leveraging OpenTelemetry, Multiplayer works with any observability platform, ensuring no vendor lock in or the need for additional tools.

Recording control: always-on vs choose-your-adventure

LogRocket: Always-on recording via SDK

LogRocket uses always-on recording through its SDK: you're recording and storing everything, whether you need it or not. This works for aggregate analytics, but creates friction for debugging:

No granular control over when and which sessions to capture
Can't easily capture specific user cohorts or error scenarios
Limited to SDK installation (no browser extensions or widgets for end-users)

Multiplayer: record what you need, when you need it

Multiplayer offers three recording modes and three installation methods. It’s a choose-your-own-adventure approach that adapts to your teams’ workflows.

Recording modes:

On-demand: Start/stop recording manually. Perfect for reproducing specific bugs.
Continuous: Start/stop recording in the background, for your entire working session. Great for development and QA to automatically save sessions with errors and exceptions.
Conditional: Silent capture of specific user cohorts or error conditions.

Installation methods:

In-app widget: Let users report issues with replays attached automatically, directly from your app
Browser extension: Quickly capture a bug, unexpected behavior, or new feature idea
SDK / CLI Apps: Full integration for programmatic control

Real scenario: Your support team gets a vague bug description. They ask the end-user to record a full stack session replay through the in-app widget. Support is able to fully understand the problem and they can reproduce the issue in 30 seconds. It’s immediately clear what the next steps (or possible fixes) are.

Support workflows: serial handoffs vs parallel collaboration

LogRocket: Built for analytics, adapted for debugging

LogRocket's collaboration features focus on sharing and reviewing sessions:

Share session links
View frontend behavior
Check integrated backend data (when available and not sampled out)

But for technical debugging, you're still doing serial handoffs:

Support searches and watches the replay (frontend only)
Support checks for backend data (might be sampled out)
Support escalates to Engineering with partial context
Engineering opens APM tool to find complete traces
Engineering searches for request/response content
Multiple rounds of back-and-forth to gather full context

Multiplayer: complete context from the start

Multiplayer is built for parallel Support ↔ Engineering workflows:

Single, sharable timeline:

Frontend screens, user actions, backend traces, logs, request/response data, and user feedback, all correlated automatically
Support sees the user's experience; Engineering sees the technical root cause
Both work from the same data, at the same time

Annotations and collaboration:

Sketch directly on recordings
Annotate any data point in the timeline
Create interactive sandboxes for API integration
Link sessions directly to Zendesk, Intercom, or Jira tickets

Real scenario: Support receives a bug report via the in-app widget (with replay automatically attached). They open it, see the user's error, scroll down to see the backend trace showing a 500 error from the auth service, view the exact request that failed, annotate the failing request, and share with the backend team, all in 60 seconds. The backend team has complete context and starts fixing the issue immediately.

What you actually get per session

LogRocket captures:

User clicks ✓
Page navigations ✓
DOM events ✓
Console messages ✓
Network requests ✓
Backend traces (link to another tool, sampled data) ✓

Multiplayer captures:

Everything LogRocket captures, plus:

Correlated backend logs and traces (any observability platform, unsampled) ✓
Backend errors ✓
Full request/response content and headers (including from internal service calls) ✓
User feedback integrated in the timeline ✓
Service and dependency maps ✓

Integration and deployment: flexibility matters

LogRocket:

SDK installation only
Requires third-party APM integration for backend data (additional vendor, additional setup)
Proprietary AI agent working with partial context and no support for AI coding workflows beyond their platform

Multiplayer:

Multiple installation methods (extension, widget, SDK)
Works with any observability platform, language, framework, architecture
MCP server for AI-native debugging in your IDE or AI assistant

For AI-forward teams: LogRocket's proprietary AI works only within their platform and has limited context. Multiplayer's MCP server feeds complete session context (frontend + unsampled backend + annotations + full request/response data) directly to Claude, Cursor, or your AI tool of choice. Ask "why did this checkout fail?" and get answers grounded in complete, unsampled session data.

Which tool should you choose

Choose Multiplayer if:

You need to fix bugs and resolve technical issues fast
You want complete, unsampled backend visibility without integration complexity
Your support team regularly escalates issues to engineering
You need full request/response content from internal services and middleware
You want flexible recording modes (not just always-on)
You want AI-native debugging workflows with complete context

Choose LogRocket if:

Your primary goal is product analytics and frontend monitoring
PM and product teams are your main users
You're comfortable with sampled backend data and managing APM integrations
Always-on, frontend-focused recording meets your needs

Consider both if:

You're a large organization where user analytics and technical debugging are handled by separate teams with separate objectives

The bottom line

LogRocket is a solid user analytics platform with frontend monitoring capabilities. The APM integrations add some backend visibility, but you're still working with sampled data, missing critical information, and switching between tools to piece together what happened.

Multiplayer gives you the complete picture: frontend and backend, unsampled traces, full request/response content, all correlated automatically in a single timeline. It's session replay designed for the reality of debugging modern distributed systems, where you need complete technical context to fix issues fast.

Start a free plan

Multiplayer vs Mixpanel: which session replay tool actually fixes bugs?

Vladi Stevanovic — Wed, 10 Dec 2025 12:21:00 GMT

You've got a critical bug report. A user can't complete their purchase at checkout. You open Mixpanel, navigate to session replay, watch them click through the checkout flow... and then they get stuck. The frontend looks fine, but something's clearly broken. What failed on the backend? Was it a payment service timeout? A validation error?

Now you're digging through logs, checking APM dashboards, correlating timestamps, and trying to piece together what happened on the backend.

This is the gap between product analytics platforms with session replay features and purpose-built debugging tools.

TL;DR

Choose Multiplayer if: You need to resolve technical issues fast, with complete frontend + backend context in one place, and you need to improve your debugging workflows across multiple teams (e.g. Support → Engineering).

Choose Mixpanel if: You primarily need product analytics with session replay as a supplementary feature for understanding user behavior.

Key difference: Mixpanel shows you how users behave on your frontend, aggregating website performance metrics. Multiplayer shows how your system behaves, from user actions to backend traces, and how to fix a bug (or have your AI coding assistant do it for you).

Quick comparison

	Multiplayer	Mixpanel
Primary use case	Debug technical issues with full-stack context	Product analytics with session replay
Data captured	Frontend + backend traces, logs, requests/responses	Frontend only
Recording control	Multiple recording modes (on-demand, continuous, conditional)	"Always-on" recording
Installation	Browser extension, widget, SDK	SDK only
Collaboration	View, share, and annotate replays	View and share replays
Backend visibility	Native and customizable	None
Deployment	SaaS or self-hosted	SaaS only

More resources:

The real difference: frontend vs full stack

Mixpanel: product analytics platform with session replay bolted on

Mixpanel is a mature product analytics platform with core features such as: event tracking, funnels, cohort analysis, A/B testing. Session replays is an additional feature in this toolset to help understand user behavior and product metrics.

But when you need to debug a technical issue, you only get frontend data. Mixpanel has no backend data or observability tools integrations, which means:

No visibility into API calls beyond the browser
No distributed traces showing which services were involved
No request/response content from your backend services
No console messages or HTML source code

When debugging a production issue, you're forced to:

Search through Mixpanel's session replays (frontend only)
Switch to your observability platform and hunt through logs to find the right data
Manually correlate timestamps across systems
Piece together what happened without a unified view

Multiplayer: purpose-built for debugging

Multiplayer captures full stack session recordings by default. Every frontend action is automatically correlated with backend traces, logs, and request/response data, in a single, unified timeline.

When the checkout button fails, you see:

The user's click
The API request
The backend trace showing which service failed
The exact error message and stack trace
Request/response content and headers from internal service calls

No hunting. No manual correlation. No tool switching. Everything you need to fix the bug is in one place.

Recording control: always-on vs choose-your-adventure

Mixpanel: Always-on recording via SDK

Mixpanel uses always-on recording through its SDK: you're recording and storing everything, whether you need it or not. This works for aggregate analytics, but creates friction for debugging:

No granular control over when and which sessions to capture
Can't easily capture specific user cohorts or error scenarios
Limited to SDK installation (no browser extensions or widgets for end-users)

Multiplayer: record what you need, when you need it

Multiplayer offers three recording modes and three installation methods. It’s a choose-your-own-adventure approach that adapts to your teams’ workflows.

Recording modes:

On-demand: Start/stop recording manually. Perfect for reproducing specific bugs.
Continuous: Start/stop recording in the background, during your entire working session. Great for development and QA to automatically save sessions with errors and exceptions.
Conditional: Silent capture of specific user cohorts or error conditions.

Installation methods:

In-app widget: Let users report issues with replays attached automatically, directly from your app
Browser extension: Quickly capture a bug, unexpected behavior, or new feature idea
SDK / CLI Apps: Full integration for programmatic control

Support workflows: serial handoffs vs parallel collaboration

Mixpanel: Built for product teams, not support workflows

Mixpanel's workflow is designed for product analytics:

Track events and user properties
Analyze funnels and retention
View session replays as supplementary context
Share reports and dashboards

For technical debugging, you're doing manual work:

Support searches and watches a session replay in Mixpanel
Support escalates to Engineering with partial context
Engineering opens observability tools to find backend data
Engineering searches for the right logs and traces
Multiple rounds of back-and-forth to gather full context

Multiplayer: complete context from the start

Multiplayer is built for parallel Support ↔ Engineering workflows:

Single, sharable timeline:

Frontend screens, user actions, backend traces, logs, request/response data, and user feedback, all correlated automatically
Support sees the user's experience; Engineering sees the technical root cause
Both work from the same data, at the same time

Annotations and collaboration:

Sketch directly on recordings
Annotate any data point in the timeline
Create interactive sandboxes for API integrations
Link sessions directly to Zendesk, Intercom, or Jira tickets

What you actually get per session

Mixpanel captures:

User clicks ✓
Page navigations ✓
DOM events ✓
Network requests ✓

Multiplayer captures:

Everything Mixpanel captures, plus:

Console messages ✓
HTML source code ✓
Correlated backend logs and traces (any observability platform, unsampled) ✓
Backend errors ✓
Full request/response content and headers (including from internal service calls) ✓
User feedback integrated in the timeline ✓
Service and dependency maps ✓

Integration and deployment: flexibility matters

Mixpanel:

SDK installation only
SaaS deployment only
MCP server for interrogating product data (not debugging)

Multiplayer:

Multiple installation methods (extension, widget, SDK)
SaaS or self-hosted deployment
Works with any observability platform, language, framework, architecture
MCP server feeds complete context to your IDE or AI tool

For teams with compliance requirements: Mixpanel's SaaS-only model can be a dealbreaker. Multiplayer's self-hosted option keeps sensitive data in your infrastructure.

For AI-forward teams: Mixpanel's MCP server is optimized for product data analysis: understanding user behavior and product metrics. Multiplayer's MCP server feeds complete debugging context (frontend + unsampled backend + annotations + full request/response data) directly to Claude, Cursor, or your AI tool of choice. Ask "why did this checkout fail?" and get answers grounded in complete session data, not just frontend clicks.

Which tool should you choose

Choose Multiplayer if:

You need to fix bugs and resolve technical issues fast
You want complete backend visibility alongside frontend data
Your support team regularly escalates issues to engineering
You need full request/response content from internal services
You want flexible recording modes and installation options
You have compliance requirements that need self-hosting
You want AI-native debugging workflows with complete context

Choose Mixpanel if:

Your primary goal is product analytics (funnels, cohorts, retention, A/B testing)
Product and UX teams are your main users
Session replay is a supplementary feature for understanding user behavior
You don't need backend debugging data
You're comfortable managing separate tools for analytics and debugging

Consider both if:

You're a large organization where product analytics and technical debugging are handled by separate teams with separate objectives

The bottom line

Mixpanel is a powerful product analytics platform with comprehensive event tracking and analysis capabilities. Session replay is an add-on feature designed for understanding user behavior, not for debugging technical issues across your full stack.

Multiplayer is purpose-built for debugging. Full-stack session recordings give you frontend and backend context, automatically correlated in a single timeline. It's session replay designed for the reality of modern distributed systems, where you need complete technical context to fix issues fast.

Start a free plan

Multiplayer vs PostHog: which session replay tool actually fixes bugs?

Vladi Stevanovic — Tue, 09 Dec 2025 11:15:00 GMT

You've got a bug report from a frustrated user. You open PostHog, search through all the session replays to find the right one, watch the frontend interaction, and see where they got stuck. But you can't see what failed on the backend. Was it a timeout? A validation error? A service dependency issue?

Now you're digging through logs, checking APM dashboards, correlating timestamps, and trying to piece together what happened on the backend.

This is the gap between product analytics platforms with session replay features and purpose-built debugging tools.

TL;DR

Choose PostHog if: You primarily need product analytics with session replay as a supplementary feature for understanding user behavior.

Key difference: PostHog is a product analytics platform with frontend-only session replay. Multiplayer is purpose-built for debugging with full-stack session recordings, from user actions to backend traces, showing you how to fix a bug (or have your AI coding assistant do it for you).

Quick comparison

	Multiplayer	PostHog
Primary use case	Debug technical issues with full-stack context	Product analytics with session replay
Data captured	Frontend + backend traces, logs, requests/responses	Frontend only
Recording control	Multiple recording modes (on-demand, continuous, conditional)	Conditional recording
Installation	Browser extension, widget, SDK	SDK only
Collaboration	View, share, and annotate replays	View and share replays
Backend visibility	Native and customizable	None
AI-native	MCP server feeds complete context to your IDE or AI tool	MCP server for interrogating product data

More resources:

The real difference: product analytics vs debugging tool

PostHog: Analytics platform with session replay

PostHog is a comprehensive product analytics platform. Session replay is one feature among many (feature flags, A/B testing, surveys, product analytics). For understanding user behavior, funnel analysis, and product decisions, this works well.

But when you need to debug a technical issue, you only get frontend data. PostHog has no backend data or observability integrations, which means:

No visibility into API calls beyond the browser
No distributed traces showing which services were involved
No request/response content from your backend services

When debugging a production issue, you're forced to:

Search through PostHog to find the right session replay (frontend only)
Switch to your observability platform for backend data
Manually correlate timestamps across systems
Piece together what happened without a unified view

Multiplayer: Purpose-built for debugging

Multiplayer is focused on resolving technical issues. Full-stack session recordings capture everything you need in a single timeline:

The user's frontend actions
The API requests
Backend traces showing which services were called
Request/response content and headers from internal service calls
Error messages and stack traces
User feedback

No searching through hundreds of sessions. No tool switching. No manual correlation.

Recording control: analytics-first vs choose-your-adventure

PostHog: conditional recording

PostHog offers always-on recording (via SDK) based on conditions you can customize. This works for product analytics where you want to capture broad user behavior:

No granular control over when and which sessions to capture
Limited to SDK installation (no browser extensions or widgets for end-users)

Multiplayer: record what you need, when you need it

Multiplayer offers three recording modes and three installation methods. It’s a choose-your-own-adventure approach that adapts to your teams’ workflows.

Recording modes:

On-demand: Start/stop recording manually. Perfect for reproducing specific bugs.
Continuous: Start/stop recording in the background during your entire working session. Great for development and QA to automatically save sessions with errors and exceptions.
Conditional: Silent capture of specific user cohorts or error conditions.

Installation methods:

In-app widget: Let users report issues with replays attached automatically, directly from your app
Browser extension: Quickly capture a bug, unexpected behavior, or new feature idea
SDK / CLI Apps: Full integration for programmatic control

Support workflows: serial handoffs vs parallel collaboration

PostHog: Built for product teams, adapted for support

PostHog's workflow is designed for product analytics:

Search through session replays to find the relevant one
Share session links with your team
View frontend behavior
Build dashboards and funnels

But for technical debugging, this creates serial handoffs:

Support searches and watches the replay (frontend only)
Support escalates to Engineering with partial context
Engineering opens observability tools to find backend data
Engineering searches for the right logs and traces
Multiple rounds of back-and-forth to gather full context

Multiplayer: complete context from the start

Multiplayer is built for parallel Support ↔ Engineering workflows:

Single, sharable timeline:

Frontend screens, user actions, backend traces, logs, request/response data, and user feedback, all correlated automatically
Support sees the user's experience; Engineering sees the technical root cause
Both work from the same data, at the same time

Annotations and collaboration:

Sketch directly on recordings
Annotate any data point in the timeline
Create interactive sandboxes for API integrations
Link sessions directly to Zendesk, Intercom, or Jira tickets

What you actually get per session

PostHog captures:

User clicks ✓
Page navigations ✓
DOM events ✓
Console messages ✓
Network requests ✓
Backend errors (requires PostHog backend instrumentation—vendor lock-in) ✓

Multiplayer captures:

Everything PostHog captures, plus:

Correlated backend logs and traces (any observability platform, unsampled) ✓
Backend errors (no vendor lock-in) ✓
Full request/response content and headers (including from internal service calls) ✓
User feedback integrated in the timeline ✓
Service and dependency maps ✓

Which tool should you choose

Choose Multiplayer if:

You need to fix bugs and resolve technical issues fast
You want complete backend visibility alongside frontend data
Your support team regularly escalates issues to engineering
You need full request/response content from internal services
You want flexible recording modes and installation options
You want AI-native debugging workflows with complete context

Choose PostHog if:

Your primary goal is product analytics (funnels, feature flags, A/B testing, surveys)
Product and UX teams are your main users
Session replay is a supplementary feature for understanding user behavior
You don't need backend debugging data
You're comfortable managing separate tools for analytics and debugging

Consider both if:

You're a large organization where product analytics and technical debugging are handled by separate teams with separate objectives

The bottom line

PostHog is a powerful product analytics platform with many valuable features. Session replay is one tool among many, designed for understanding user behavior and product performance, not for debugging technical issues across your full stack.

Start a free plan

Multiplayer vs Fullstory: which session replay tool actually gives you the full story?

Vladi Stevanovic — Mon, 08 Dec 2025 11:10:00 GMT

You've got a critical bug report. A user can't complete checkout. You open Fullstory, watch the session replay, see them click the checkout button... and then what? The frontend looks fine, but something's clearly broken. Now you're digging through logs, checking APM dashboards, correlating timestamps, and trying to piece together what happened on the backend.

This is the gap between user analytics tools and debugging tools.

TL;DR

Choose Multiplayer if: You need to resolve technical issues fast, with full frontend + backend context in one place, and you need to improve your debugging workflows across multiple teams (e.g. Support → Engineering).

Choose Fullstory if: You primarily need behavioral analytics for product and UX decisions.

Key difference: Fullstory shows you how users behave on your website, aggregating performance metrics. Multiplayer shows how your system behaves, from user actions to backend traces, and how to fix a bug (or have your AI coding assistant do it for you).

Quick comparison

	Multiplayer	Fullstory
Primary use case	Debug technical issues with full-stack context	Analyze user behavior and UX at scale
Data captured	Frontend + backend traces, logs, requests/responses	Frontend only
Recording control	Multiple recording modes (on-demand, continuous, conditional)	“Always-on” recording
Installation	Browser extension, widget, SDK	SDK only
Collaboration	View, share, and annotate replays	View and share replays
Backend visibility	Native and customizable	None
Deployment	SaaS or self-hosted	SaaS only

More resources:

The real difference: frontend vs full stack

Fullstory (or half the story?)

Fullstory captures what happens in the browser: clicks, page loads, DOM events. For understanding user flows and UX patterns, this is valuable. But when you're debugging a technical issue, you're missing the critical half: what happened in your backend.

When an API call fails, a database query times out, or a microservice throws an error, you're forced to:

Switch to your observability platform
Manually correlate timestamps
Hunt through logs to find the right data
Piece together context across multiple tools

Multiplayer: the actual full story

Multiplayer captures full stack session recordings by default. Every frontend action is automatically correlated with backend traces, logs, and request/response data, in a single, unified timeline.

When that checkout button fails, you see:

The user's click
The API request
The backend trace showing which service failed
The exact error message and stack trace
Request/response content and headers from internal service calls

No hunting. No manual correlation. No tool switching. Everything you need to fix the bug is in one place.

Real scenario: A user reports "payment failed" but your logs show a 200 response. With Multiplayer, you see: the button click, the API call to your payment service, the upstream call to Stripe, the 429 rate limit error from Stripe, and the incorrectly handled error response your service returned as a 200.

Recording control: always-on vs choose-your-adventure

Fullstory: Always-on recording via SDK

Fullstory uses always-on recording through its SDK: you're recording and storing everything, whether you need it or not. This works fine for aggregate analytics, but creates friction for debugging:

No granular control over when and which sessions to capture
Can't easily capture specific user cohorts or error scenarios
Limited to SDK installation (no browser extensions or widgets for end-users)

Multiplayer: record what you need, when you need it

Multiplayer offers three recording modes and three installation methods. It’s a choose-your-own-adventure approach that adapts to your teams’ workflows.

Recording modes:

On-demand: Start/stop recording manually. Perfect for reproducing specific bugs.
Continuous: Start/stop recording in the background during your entire working session. Great for development and QA to automatically save sessions with errors and exceptions.
Conditional: Silent capture of specific user cohorts or error conditions.

Installation methods:

In-app widget: Let users report issues with replays attached automatically, directly from your app
Browser extension: Quickly capture a bug, unexpected behavior, or new feature idea
SDK / CLI Apps: Full integration for programmatic control

Support workflows: serial handoffs vs parallel collaboration

Fullstory: built for analysts, not debugging teams

Fullstory's collaboration features are designed for PM and UX teams reviewing sessions asynchronously:

Share session links
Add highlights and notes
Build funnels and dashboards

But for technical debugging, this creates serial handoffs:

Support searches and watches the replay (frontend only)
Support escalates to Engineering with partial context
Engineering opens observability tools to find backend data
Engineering asks follow-up questions
Support provides more details
Repeat until enough context is gathered

Multiplayer: complete context from the start

Multiplayer is built for parallel Support ↔ Engineering workflows:

Single, sharable timeline:

Frontend screens, user actions, backend traces, logs, request/response data, and user feedback, all correlated automatically
Support sees the user's experience; Engineering sees the technical root cause
Both work from the same data, at the same time

Annotations and collaboration:

Sketch directly on recordings
Annotate any data point in the timeline
Create interactive sandboxes for API integrations
Link sessions directly to Zendesk, Intercom, or Jira tickets

What you actually get per session

Fullstory captures:

User clicks ✓
Page navigations ✓
DOM events ✓
Console messages (browser only) ✓
Network requests (paid plans only) ✓

Multiplayer captures:

Everything Fullstory captures, plus:

Correlated backend logs and traces (any observability platform, unsampled) ✓
Backend errors ✓
Full request/response content and headers (including from internal service calls) ✓
User feedback integrated in the timeline ✓
Service and dependency maps ✓

Integration and deployment: flexibility matters

Fullstory:

SDK installation only
SaaS deployment only
Mobile support is a paid add-on
No backend visibility or observability integrations
No support for AI coding workflows

Multiplayer:

Web and mobile support out of the box
Multiple installation methods (extension, widget, SDK)
SaaS or self-hosted deployment
Works with any observability platform (Datadog, New Relic, Grafana, etc.), language, framework, and architecture
MCP server for AI-native debugging in your IDE or AI assistant

For teams with compliance requirements: Fullstory's SaaS-only model can be a dealbreaker. Multiplayer's self-hosted option keeps sensitive data in your infrastructure.

For AI-forward teams: Multiplayer's MCP server feeds complete session context (frontend + backend + annotations) directly to Claude, Cursor, or your AI tool of choice. Ask "why did this checkout fail?" and get answers grounded in the actual session data.

Which tool should you choose

Choose Multiplayer if:

You need to fix bugs and resolve technical issues fast
Your support team regularly escalates issues to engineering
You need backend visibility alongside frontend data
You want flexible recording modes (not just always-on)
You need to correlate frontend and backend data without manual work
You have compliance requirements that need self-hosting
You want AI-native debugging workflows

Choose Fullstory if:

Your primary goal is user analytics and UX optimization
PM and design teams are your main users
You don't need backend data integrated with session replays
Always-on, frontend-only recording meets your needs

Consider both if:

You're a large organization where user analytics and technical debugging are handled by separate teams with separate objectives

The bottom line

Fullstory is a powerful behavioral analytics platform. But if you're using it to debug technical issues, you're working with one hand tied behind your back. You're missing backend data, manually correlating across tools, and creating slow handoffs between support and engineering.

Multiplayer gives you the complete picture: frontend and backend, correlated automatically, in a single timeline, with purpose-built collaboration for technical teams. It's session replay designed for the reality of modern distributed systems.

Start a free plan

Multiplayer sketches: annotating session recordings for better collaboration

Vladi Stevanovic — Mon, 24 Nov 2025 07:48:00 GMT

Whiteboarding tools are indispensable in system design for visually conveying concepts, ideas, and rough plans. They tap into our natural preference for visual learning. Most people, after all, agree that "a picture is worth a thousand words."

But static whiteboarding tools lack the crucial element that makes feedback truly actionable: context.

That's why we evolved our Sketches feature into Annotations, a way to draw, write, and comment directly on top of full-stack session recordings. Now, instead of sketching ideas in isolation, teams can mark up actual user sessions, highlighting specific UI elements, API calls, and backend traces that need attention.

Why Annotate Session Recordings?

Multiplayer automatically captures everything happening in your application: frontend screens, user actions, backend traces, metrics, logs, and full request/response content and headers. But when something goes wrong or needs improvement, pointing at the exact moment and explaining what should change requires more than just text.

Annotations let you:

Draw directly on the replay with shapes, arrows, and highlights to mark problem areas or desired changes
Add on-screen text to explain intended behavior or specify new UI copy
Attach timestamp notes to clarify reproduction steps, requirements, or design intentions
Reference full-stack context by annotating user clicks, API calls, traces, and spans directly

Because Multiplayer auto-correlates frontend and backend data, your annotations aren't just surface-level markup: they're tied to the actual technical events that need investigation or modification.

How Support Teams Use Annotations

1. Clarifying Bug Reports

When a customer reports confusing behavior, support teams can create an annotated recording that shows:

Red circles highlighting where the UI behaved unexpectedly
Arrows pointing to the button that should have appeared
Text annotations explaining what the customer expected to see
Timestamp notes marking the exact API call that returned the wrong data

This annotated session becomes a complete bug report that engineering can understand immediately. No back-and-forth required.

2. Documenting Reproduction Steps

Instead of writing lengthy reproduction steps like "Click the dashboard, then filters, then date range, then apply," support can:

Record themselves reproducing the issue once
Add timestamp notes at key moments: "User opens filters here," "Selects invalid date range," "Error appears at 0:45"
Highlight the error message in red with a note: "This message is confusing. We should clarify valid date format"

Engineering gets a visual, interactive guide to the problem with full backend context included.

3. Collecting Feature Requests with Visual Context

When customers suggest improvements, support can annotate recordings to show:

Green highlights around areas customers want enhanced
Sketched mockups showing proposed layouts
Text annotations with customer quotes about desired behavior

How Engineering Teams Use Annotations

1. Reviewing PRs with Visual Feedback

During code review, engineers can record themselves testing a new feature and add annotations:

Yellow boxes around UI elements that need spacing adjustments
Arrows indicating where loading states should appear
Text specifying exact pixel values or color codes
Timestamp notes on API calls: "This endpoint takes 2.3s, should we add caching?"

The developer receives actionable visual feedback tied to actual runtime behavior, not abstract suggestions.

2. Debugging with Annotated Evidence

When investigating production issues, engineers can:

Record a session where the bug occurs
Circle the problematic UI element in red
Add arrows pointing from the frontend error to the failing API trace
Annotate the trace span with notes: "This database query times out under load"

This creates a self-documenting investigation that other team members can follow.

3. Planning Refactors with Visual Context

Before refactoring complex flows, teams can:

Record the current user journey
Use different colored annotations to map out different concerns (blue for performance, purple for UX improvements, orange for tech debt)
Add timestamp notes explaining why each step exists
Sketch the proposed new flow directly on top of the recording
Reference specific API calls and traces that will be affected

4. Onboarding New Engineers

Senior engineers can create annotated recordings that serve as interactive documentation:

Record a typical user flow
Add green annotations explaining key architectural decisions
Highlight important code paths with timestamp notes
Mark API boundaries and service interactions
Sketch out related system components and their relationships

New team members can pause, replay, and reference the full-stack context as they learn.

Start a free plan

Six best practices for backend design in distributed system

Vladi Stevanovic — Thu, 20 Nov 2025 23:31:00 GMT

Most modern software systems are distributed systems. Designing and maintaining a distributed system, however, isn't easy. There are so many areas to master: communication, security, reliability, concurrency, and, crucially, observability and debugging.

When things go wrong (and they will as we've seen recently and repeatedly), you need to understand what happened across your entire stack.

Here are six best practices to get you started:

(1) Design for failure (and debuggability)

Failure is inevitable in distributed systems. Most of us are familiar with the 8 fallacies of distributed computing, those optimistic assumptions that don't hold in the real world. Switches go down. Garbage collection pauses make leaders "disappear." Socket writes appear to succeed but have actually failed on some machines. A slow disk drive on one machine causes a communication protocol in the whole cluster to crawl.

Back in 2009, Google fellow Jeff Dean cataloged the "Joys of Real Hardware," noting that in a typical year, a cluster will experience around 20 rack failures, 8 network maintenances, and at least one PDU failure.

Fast forward to 2025, and outages remain a fact of life:

November 18, 2025 - A bug in Cloudflare's Bot Management system caused widespread disruption across many web services
October 20, 2025 - An AWS disruption in the US-East-1 region affected thousands of applications
April 28, 2025 - A power-grid collapse in Spain and Portugal disrupted electric, telecommunication, and transport services across the Iberian Peninsula

The lesson? Design your system assuming it will fail, not hoping it won't. Build in graceful degradation, redundancy, and fault tolerance from the start.

But resilience isn't enough. You also need debuggability. When (not if) failures occur, your team needs answers fast:

What triggered the failure? The user action, the API call, the specific request that started the cascade
How did it propagate? Which services were involved, what data was passed between them, where did things go wrong
Why did it happen? The root cause. Whether in your backend logic, database queries, or infrastructure layer

This requires capturing complete technical context, not just high-level signals. Aggregate metrics and sampled traces tell you something is wrong. Full context tells you exactly what went wrong and why.

Traditional monitoring gives you: "The system is slow."

What you actually need: "This specific user's checkout failed because the payment service timed out waiting for the inventory service, which was blocked on a slow database query."

The difference between these two statements is the difference between hours of investigation and minutes to resolution.

Visual representation of the 8 fallacies of distributed computing, by Denise Yu.

(2) Choose your consistency and availability models

Generally, in a distributed system, locks are impractical to implement and difficult to scale. As a result, you'll need to make trade-offs between the consistency and availability of data. In many cases, availability can be prioritized and consistency guarantees weakened to eventual consistency, with data structures such as CRDTs (Conflict-free Replicated Data Types).

It's also important to note that most modern systems use different models for different data. User profile updates might be eventually consistent, while financial transactions require strong consistency. Design your system with these nuances in mind rather than applying one model everywhere.

A few more considerations:

Pay attention to data consistency: When researching which consistency model is appropriate for your system (and how to design it to handle conflicts and inconsistencies), review foundational resources like The Byzantine Generals Problem and the Raft Consensus Algorithm. Understanding these concepts helps you reason about what guarantees your system can actually provide and what it can't.

Strive for at least partial availability: You want the ability to return some results even when parts of your system are failing. The CAP theorem (Consistency, Availability, and Partition Tolerance) is well-suited for critiquing a distributed system design and understanding what trade-offs need to be made. Remember: out of C, A, and P, you can't choose CA. Network partitions will happen, so you're really choosing between consistency and availability when partitions occur.

(3) Build on a solid foundation from the start

Whether you're a pre-seed startup working on your first product, or an enterprise company releasing a new feature, you want to assume success for your project.

This means choosing the technologies, architecture, and protocols that will best serve your final product and set you up for scale. A little work upfront in these areas will lead to more speed down the line:

Security: A zero-trust architecture is the standard: assume breaches will happen and design accordingly to minimize your blast radius.

Containers: Some may still consider containers an advanced technique, but modern container runtimes have matured significantly, making containerization a default choice

Orchestration: Reduce the operational overhead and automate many of the tasks involved in managing containerized applications. Kubernetes has become the de facto standard, but for smaller teams, managed container services (AWS ECS/Fargate, Google Cloud Run, Azure Container Apps) offer simpler alternatives without sacrificing scalability.

Infrastructure as code: Define infrastructure resources in a consistent and repeatable way, reducing the risk of configuration errors and ensuring that infrastructure is always in a known state. Tools like Terraform, Pulumi, and AWS CDK make infrastructure changes reviewable, testable, and version-controlled.

Standard communication protocols: REST, gRPC, GraphQL, and other well-established protocols simplify communication between different components and improve compatibility and interoperability. Choose protocols that match your use case: REST for simplicity, gRPC for performance, GraphQL for flexible client needs.

Observability from day one: Don't treat logging, metrics, and tracing as something you add later. Build observability into your system from the start, including structured logging, distributed tracing, and comprehensive session recording. When issues arise (and they will), having this context already in place is the difference between quick resolution and prolonged outages.

(4) Minimize dependencies

If the goal is to have a system that is resilient, scalable, and fault-tolerant, then you need to consider reducing dependencies with a combination of architectural, infrastructure, and communication patterns.

Service Decomposition: Each service should be responsible for a specific business capability, and they should communicate with each other using well-defined APIs. Start with a well-modularized monolith and extract services only when you have clear reasons (team autonomy, different scaling needs, technology requirements).

Organization of code: Choosing between a monorepo or polyrepo depends on your project requirements. Monorepos excel at atomic changes across services and shared tooling, while polyrepos provide stronger boundaries and independent versioning. Modern monorepo tools (Nx, Turborepo, Bazel) have made the monorepo approach increasingly viable even at large scale.

Service Mesh: A dedicated infrastructure layer for managing service-to-service communication provides a uniform way of handling traffic between services, including routing, load balancing, service discovery, and fault tolerance. Service meshes like Istio, Linkerd, and Consul add complexity (so evaluate carefully whether you actually need one!) but solve real problems at scale.

Asynchronous Communication: By using patterns like message queues and event streams, you can decouple services from one another. This reduces cascading failures: if one service is down, messages queue up rather than causing immediate failures. Tools like Kafka, RabbitMQ, and cloud-native options (AWS SQS, Google Pub/Sub) enable this decoupling.

Circuit breakers and timeouts: Implement patterns that prevent cascading failures. When a downstream service is struggling, circuit breakers stop sending it traffic, giving it time to recover. Proper timeouts prevent one slow service from tying up resources across your entire system.

(5) Monitor and measure system performance

In a distributed system, it can be difficult to identify the root cause of performance issues, especially when there are multiple systems involved.

Any developer can attest that "it's slow" is and will be one of the hardest problems you'll ever debug!

In recent years we've seen a shift from traditional Application Performance Monitoring (APM) to modern observability practices, as the need to identify and understand "unknown unknowns" becomes more critical.

Traditional APM tools excel at answering questions you already know to ask: "Is the database slow?", "What's the error rate?", etc. But struggle with the unexpected, hard-to-reproduce and understand issues that plague distributed systems. That's why modern observability focuses on capturing complete context about system behavior.

Rather than just collecting aggregate metrics and sampled traces, comprehensive observability tools capture:

Complete request traces across your entire distributed system, not just statistical samples
Full session context showing what users actually did, not just backend telemetry
Detailed interaction data including request/response payloads, database queries, and service call chains
Correlated frontend and backend behavior so you can see how user actions translate to system load

This approach shifts focus from reactive monitoring ("the system is down, what happened?") to proactive understanding ("why is this specific user experiencing slowness?"). Full stack session recordings exemplify this shift: they capture complete user journeys along with all the technical context needed to understand exactly what happened.

(6) Design dev-first debugging workflows

Most debugging workflows evolved accidentally. Support collects what they can from end-users. Escalation specialists add a few notes. Engineers get a ticket with partial logs, a vague user description, and maybe a screenshot or video recording.
Then the real work begins: clarifying, reproducing, correlating, guessing.

This is backward.

In modern distributed systems, developers are your most expensive, highest-leverage resource. Every minute they spend asking for missing context, grepping through log files, or reconstructing what happened is a minute they’re not fixing the problem, improving the system, or shipping value.

Dev-first debugging flips this model. Instead of assembling context, your tools should capture everything by default:

Exact user actions and UI state
Correlated backend traces, logs, and events
Request/response bodies and headers
Annotations, sketches, and feedback from all stakeholders

This eliminates the slowest, most painful part of every incident: figuring out what actually happened.

A dev-first debugging workflow ensures that the very first time an engineer opens a ticket, they already have the full picture. No Slack threads, no Zoom calls to “walk through what you saw,” no repeated requests for “more info,” no guesswork.

In 2025’s increasingly complex distributed environments, designing your debugging workflows around complete, structured, immediately available context is one of the highest-impact decisions you can make.

Start a free plan

High user satisfaction scores aren’t worth a burned-out team

Vladi Stevanovic — Mon, 17 Nov 2025 09:00:00 GMT

End-user support has always been messy. Manual steps, tool-switching, and scattered communication turn what should be a simple fix into a marathon of frustration.

Tickets feel like scavenger hunts: everyone’s searching for details, logs, screenshots, or that missing repro step.

Developers are left waiting on context that never arrives.

Support teams spend hours chasing updates across email threads and Slack channels.

And when context lives across ten tools, nobody gets the full story.

The result? High user satisfaction scores on paper and burned-out support and engineering teams behind them.

The usual support workflow

We’ve spoken with many of our users and they usually describe a support workflow that takes a multi-person, multi-email, multi-tool journey throughout their organization.

Here’s what “good” support often looks like in practice:

User reports an issue. A user reports an issue. Some companies have one ticketing system, others accept chat, email, bug forms, even social DMs. The same problem can arrive through multiple channels, which already splits the context.
End-user <> Support back-and-forth. The first ticket rarely contains everything a Support Engineer or Developer needs to fully understand the problem. You might get a symptom, a screenshot, or “it’s slow”. Support starts a back-and-forth to collect the missing pieces: steps to reproduce, browser and device, account, exact time, error messages. This can take hours or days as people reply in different time zones or forget details.
Handoff to engineering. With partial context gathered, Support passes the ticket along. Engineering reads the summary, opens the attachments, and tries to map the user’s description to how the system actually works. In many instances, reproduction attempts fall short: something is still missing, so developers need to ask Support for more info, or they go hunting for more details on their own.
Side quests multiply. Engineers search logs, traces, dashboards, and metrics. They check feature flags, versions, deployments, and documentation. They ask teammates if anyone touched that part of the system. Context is scattered across tools and people, which slows everything down.
Ticket escalations. While Engineering investigates, the customer grows impatient and escalates to the Account Executive or to leadership. Inside the company, the ticket rises to Senior Engineers and other teams. Meetings are scheduled to “sync on status,” which consumes time without adding new facts.
Stakeholder pile-on. Product, CS, QA, UX, and Sales all weigh in, each bringing partial data or new questions. Information fragments across email, Slack, comments, and docs. Keeping track of what is known becomes a task of its own.

Sometimes this results in user churn. Many times, however, the story appears to end well!

The customer finally gets the solution to their problem, leaves a nice review, maybe even a five-star CSAT (customer satisfaction) rating. On paper, it’s a success: the ticket is closed, the metrics look good, and everyone moves on.

But underneath, there’s a quiet cost. The team lost hours chasing context instead of solving problems. Engineers spent more time coordinating than coding. Support burned emotional energy just keeping the process afloat. And none of that shows up on your dashboards.

Is this type of “great support” sustainable?

What happens when your user base doubles, or when the engineering roadmap shifts and the same few people can’t jump on every urgent ticket? How do you maintain speed and quality without burning through your team’s time — and energy — every time an issue hits production?

A high CSAT score is easy to celebrate. But if it comes at the expense of your team’s focus, efficiency, and well-being, it’s not a sign of healthy support.

The solution

Multiplayer transforms the chaos of debugging and support workflow. We do this through full stack session recordings: complete, correlated replays that capture everything from the frontend to the backend.

That includes:

User actions, on-screen behavior, clicks, and comments
Frontend data: DOM events, network requests, browser metadata, HTML source code
Backend traces with zero sampling, logs, request/response content, and headers
CS, developer, and QA annotations, comments, and sketches

Instead of scattered tooling and guesswork, Multiplayer gives you one replay that tells the whole story: what happened, why it happened, and how to fix it.

Support engineers get clarity instead of chaos

Instead of chasing screenshots and guessing what really happened, support engineers open a ticket and see the full story: user actions, backend behavior, and all the technical details automatically captured.

No scavenger hunts, no Slack threads, no “please send more info.” Just instant visibility and a single source of truth everyone can work from.

They get:

Automatic issue capture the moment something goes wrong
A clear replay of user actions, clicks, and feedback
A single shareable link for cross-team collaboration, complete with on-screen sketches, notes, and context

Developers get everything they wish every ticket included

By the time an issue reaches the Engineers, it already includes everything they need to reproduce and fix it. Multiplayer captures frontend and backend data, correlates it by session, and makes it available directly inside their IDE or AI tools.

No more grepping through logs or asking for repro steps. Just open the session, see what happened, and fix it.

They get:

High-fidelity repro steps with full visibility into cause and effect
Complete, unsampled full-stack data that’s already AI-ready
Support for any language, environment, or architecture

Users get faster fixes and smoother experiences

From the user’s perspective, the magic is simple: they report an issue once, and it gets resolved quickly and accurately.

No endless email loops. No repeated questions. Just responsive, reliable support that feels effortless.

They get:

Easy issue reporting through browser extension or in-app widget
Fast, accurate resolutions even for rare, intermittent, or hard-to-reproduce bugs
A better in-app experience, because every fix is grounded in real user data

Why it matters

When Support and Engineering are on the same page, everything moves faster. Users feel heard, Support teams stop chasing screenshots, and Developers can finally focus on building instead of firefighting.

Multiplayer was built for that kind of alignment: turning fragmented communication into shared understanding. We:

Eliminate incomplete bug reports and endless back-and-forth
Give every team the same full-stack context from the start
Speed up escalations and shorten resolution time
Surface root causes in minutes instead of hours (days?)
Improve overall product quality with every fix

With Multiplayer, the session itself becomes the common language between end-users, Support, and Engineering.

No lost context. No burnout. Just clear visibility, faster fixes, and teams that can finally breathe again.

Start a free plan

Full stack session recordings: end-to-end visibility in a single click

Steph Johnson — Mon, 10 Nov 2025 07:01:00 GMT

A full stack session recording captures the entire path of system behavior (from the frontend screen to the backend services) automatically correlated, enriched, and AI-ready.

In a single replay, you see:

The screens and actions a user took
The backend traces and logs triggered by those actions
The request/response content and headers exchanged between all your components
Metadata like device, environment, browser, etc.
User feedback, plus annotations, sketches, and comments from the engineering team

Instead of stitching together screenshots, console logs, APM traces, and bug tickets, a full stack session recording shows you the whole story, end to end.

Traditional session replay tools

Most tools that promise “visibility” into your system fall into one of three buckets:

Frontend session recorders (e.g. FullStory, LogRocket, Jam): Great at showing what the user saw and clicked, but they stop at the browser. Some bolt on backend visibility via integrations, which means extra cost, more tools to manage, and sampled data.
Error monitoring tools (e.g. Sentry): Useful for flagging what broke, but it's not purpose-built for collaborative debugging workflows. When you need to resolve a specific technical issue, you're left sifting through sampled session replays, manually correlating disconnected context from separate tools, and coordinating slow handoffs between support, frontend, and backend teams.
APM/observability platforms (e.g. Datadog, New Relic): Perfect for monitoring system health and long-term trends, but not for surgical, step-by-step debugging through a session replay.

How is Multiplayer different

Multiplayer is different. It gives developers the entire story in one session, so you don’t waste hours context-switching between tools, grepping through logs, or chasing repro steps.

✔️ Compatible with any existing ticketing help/desk system (e.g. Zendesk, Intercom, Jira)

✔️ Multiple options to record, install, and integrate session replays. Multiplayer adapts to your support workflow.

✔️ Developer-friendly and AI-native. Compatible with any observability platform, language, environment, architecture, and AI tool. You can also host in the cloud or self-host.

Everything you need, for any support scenario, out of the box

Multiplayer adapts to every support workflow. No extra tools, no manual workarounds, no rigid setup. Whether you’re handling a question about “unexpected behavior” or a complex cross-service incident, Multiplayer gives you the full context to resolve it. 

What makes full stack session recordings powerful?

Where traditional replays stop at the UI, full stack session recordings go deeper, capturing the entire stack, automatically.

Multiplayer makes that power practical with:

Full stack data by default: From user actions and feedback to backend traces and logs, including details most tools miss (like request/response content and headers from middleware and internal service calls).
Flexible recording modes: on-demand, continuous, or conditional
Multiple install options: browser extension, in-app widget, or SDK / CLI apps
Rich collaboration features: view, share, and annotate every aspect and data point of a session recording or collaborate in notebooks (interactive sandbox environments) to auto-generate runnable test scripts, test complex API integrations, collaborate on feature requirements or visualize data sets.
AI-native design: feed your copilots and AI IDEs the complete system context they need.

With Multiplayer, a single session replay isn’t just a playback of what happened: it’s a complete, actionable view of your system that accelerates debugging, validates fixes, and fuels development.

Start a free plan

Collect what matters: how Multiplayer stays lightweight without losing context

Vladi Stevanovic — Mon, 03 Nov 2025 18:29:00 GMT

Traditional "always-on" recording tools and APM platforms take the same brute-force approach: capture everything. Every session, every log, every metric, whether you need it or not. That flood of data creates its own problems: high storage costs, constant filtering and sampling, and hours wasted sifting for the signal inside the noise.

Multiplayer was built differently. We capture only what matters, and we do it in a way that stays lightweight and unobtrusive for users. When you need the full picture for a specific technical issue or complex, full stack bug, you have everything, correlated in one timeline.

Multiplayer setup 101

Multiplayer is designed to adapt to every support and debugging workflow, which is why we support:

Three install options (in-app widget, browser extension, SDK/CLI apps)
Three recording modes (on-demand, continuous, conditional)
Multiple options on how to send telemetry data (and how much) to Multiplayer and how to deploy (self-hosted or cloud)

It's a "choose-your-own-adventure" type of approach so that teams can mix and match the install options, recording modes and backend configuration that best fits their application needs.

How Multiplayer stays lean

Modern teams are rightly sensitive to anything that could slow users down. Multiplayer is designed to capture useful context without adding noticeable latency or chewing through bandwidth/CPU. Here’s how we stay lean:

Opt-in by default → If you’re not recording, there’s zero runtime overhead. Browser extension off? In-app widget not initialized? No impact.
Event-based, not video-based → We capture structured events (DOM mutations, clicks, network metadata), not pixel streams. The result: smaller payloads, faster uploads, less CPU.
Session-first, not “capture everything” → Multiplayer correlates full-stack data around the sessions you care about, instead of hoovering telemetry from your entire estate.
Asynchronous, batched I/O → Uploads happen in the background, off the critical path. No blocking calls that slow users down.
Backend-agnostic via OpenTelemetry → You control what’s instrumented and how much you emit, just like structured logging.

Recording modes: precision vs coverage

Most tools force a tradeoff: either record every session (expensive and noisy) or rely on on-demand captures (easy to miss unexpected issues).

Multiplayer gives you three recording modes that can be combined depending on your workflow:

On-Demand: Nothing runs until you explicitly start recording via extension, widget, or SDK. Perfect when you want zero background footprint.
Continuous: A lightweight rolling buffer (a few minutes of events) that auto-saves on errors/exceptions or when you choose to save. You catch elusive bugs without recording everything.
Conditional: This is the most similar to traditional "always-on" session capture, with the difference that you pre-select specific conditions that will trigger the recordings. In short you're recording all sessions only for a specific cohort of users.

This versatile approach to choosing and combining multiple recording modes, gives you coverage without drowning in noise or adding unnecessary performance overhead.

Installation options: full control

Different teams have different needs. Multiplayer supports different install paths so you can control overhead and scope:

Chrome browser extension
In-app widget (JavaScript client library for web applications and React Native client library for mobile apps)
SDK / CLI apps

We also offer self-hosted deployments (contact our team for more information).

Why our approach matters

Multiplayer gives you the control to record what you need, when you need it, without drowning in data or slowing down your users.

Whether you’re debugging, testing, supporting customers, or feeding AI copilots accurate context, you get the same promise: all the context, none of the noise.

Start a free plan

From session replay to development plan: annotations in full stack session recordings

Vladi Stevanovic — Mon, 27 Oct 2025 17:50:00 GMT

Traditional session replay tools give you a window into what the user saw.

A few let you blur sensitive data or leave a quick sketch. Some rely on third-party integrations to manage annotations. Most just let you add comments to the overall recording.

What they don’t give you is a way to connect annotations to the actual system data: the API calls, traces, and logs that explain what really happened. And they certainly don’t make those annotations AI-ready, so you can feed them straight into your IDE or coding assistant.

That’s where Multiplayer annotations come in.

Practical use cases

Notes in Multiplayer transform raw session recordings into executable development plans. Whether you're debugging a technical issue, clarifying requirements, planning a refactor, or designing a new feature, notes capture your thinking directly on the timeline, attached to the exact moments, interactions, and backend events that matter.

Instead of writing requirements in a vacuum or describing bugs in abstract terms, you're annotating actual behavior with full-stack context automatically included.

(1) From Replay to Plan

This is how a traditional workflow might look like:

Watch a session recording of the user actions (only frontend data)
Switch to a separate tool (Jira, Linear, Notion)
Try to describe what you saw in text
Lose technical context in translation
Respond to all the clarifying questions
Repeat the cycle

With Multiplayer annotations:

Watch the full stack session recording
Sketch directly on problem areas as you see them
Add timestamp notes explaining what should happen instead
Annotate the specific API call or trace that needs modification
Share the annotated recording with your team

The result is precise, contextualized instructions that include:

Visual markup showing exactly which UI elements need changes
Timestamp notes explaining the intended behavior at each step
References to the actual API calls, database queries, and service traces involved
On-screen text specifying new copy, error messages, or validation rules
Sketched mockups showing proposed layouts or flows

Engineers receive a complete specification with runtime context.

(2) Cross-Role Collaboration

Annotations create a shared visual language that works across teams and disciplines:

Support → Engineering handoffs: Support annotates a customer's session with red circles around the confusing UI, timestamp notes explaining what the customer tried to do, and highlights on the error response that needs better messaging. Engineering sees the bug with full reproduction context in under a minute.

Product → Engineering workflows: PM annotates a user session showing where people drop off, adds sketches proposing a new flow, and attaches notes with acceptance criteria. Engineer reviews the annotated session and knows exactly what to build, with examples of the current behavior and references to the code paths involved.

QA → Development feedback loops: QA records a test run, annotates edge cases with highlights, adds notebooks with each test scenario, and circles areas where behavior differs from specs. Developers receive visual test documentation tied to actual execution traces.

Engineering → Vendor communications: When working with third-party APIs or external teams, engineers can record integration behavior, annotate failing requests with technical details, sketch expected responses, and share the annotated session. Vendors see exactly what's happening in your system without needing access to your codebase.

AI-Ready Context

Use the Multiplayer MCP server to pull your full stack session recording screenshots and notes into your AI coding tools.

Because annotations carry metadata, they’re machine-readable. They’re not just helpful for humans, they’re structured context your AI tools can consume directly.

This means your copilot doesn’t just “see” a session: it understands the requirements, context, and team intent tied to that session. From there, it can generate accurate fixes, tests, or even implement new features with minimal prompting.

Traditional AI prompting:

"Add validation to the signup form"

AI generates generic validation without knowing your form structure, existing patterns, or backend constraints.

AI prompting with annotated sessions:

Share an annotated Multiplayer recording showing:
- The signup form with red circles around fields needing validation
- Timestamp note: "Email validation should reject addresses without proper domains"
- Highlighted API call showing the current /signup endpoint contract
- Text annotation specifying error message copy
- Trace showing the validation happens client-side only (needs backend validation too)

AI now has:

Visual context of your actual UI
Specifications for the exact behavior you want
Technical context of existing API contracts
Requirements for both frontend and backend changes
Examples of current behavior vs. desired behavior

Start a free plan