{"id":13555,"date":"2026-03-16T14:59:32","date_gmt":"2026-03-16T14:59:32","guid":{"rendered":"https:\/\/www.d-id.com\/?p=13555"},"modified":"2026-03-16T14:59:37","modified_gmt":"2026-03-16T14:59:37","slug":"v4-expressive-visual-agents","status":"publish","type":"post","link":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/","title":{"rendered":"Introducing V4 Expressive Visual Agents"},"content":{"rendered":"\n<p>Real-time, emotionally intelligent conversations. Built for product-grade scale.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-key-takeaways\"><strong>Key Takeaways<\/strong><\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>V4 Expressive Visual Agents bring emotion into live, two-way conversations<\/strong>\u2014not just pre-rendered videos. They combine expressive digital humans with an LLM \u201cbrain\u201d for real dialogue streamed in real time via WebRTC.<\/li>\n\n\n\n<li><strong>They\u2019re designed for \u201cface-to-face\u201d interaction at low latency<\/strong>, so the experience feels like a conversation, not a sequence of delayed clips.<\/li>\n\n\n\n<li><strong>You can define avatar, voice, and agent behavior in one setup<\/strong>, then deploy across use cases like support, training, internal comms, and marketing flows.<\/li>\n\n\n\n<li><strong>They\u2019re measurable by default<\/strong>: export conversation logs as structured JSON for analytics, QA, and product iteration.<\/li>\n<\/ul>\n\n\n<section class=\"c-block c-margin c-margin--top-default c-margin--bottom-default c-padding--top-default c-padding--bottom-default c-paddingm--top-default c-paddingm--bottom-default c-block b-video b-video--page-v4-expressive-visual-agents  align b-video-layout-default b-video--layout-default b-video-style-default\" id=\"b-video-1\">\n\t<div class=\"c-background c-background--container\" style=\"--bg-color: \">\n    \n    \n    \t    <div class=\"c-background__content\">\n\t\t\t<div class=\"container\">\n\t\t\t\t\t\t\t\t\t<div class=\"c-video c-video--source-embed\">\n\t\n\t\n\t\t\t\t<div class=\"c-embed\">\n\t\t<div style=\"padding:56.25% 0 0 0;position:relative;\"><iframe src=\"https:\/\/player.vimeo.com\/video\/1172575053?h=081804dbe1&amp;badge=0&amp;autopause=0&amp;player_id=0&amp;app_id=58479&amp;autoplay=1&amp;loop=1\" frameborder=\"0\" allow=\"autoplay; fullscreen; picture-in-picture; clipboard-write; encrypted-media; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" style=\"position:absolute;top:0;left:0;width:100%;height:100%;\" title=\"Expressive Agents Promo\"><\/iframe><\/div><script src=\"https:\/\/player.vimeo.com\/api\/player.js\"><\/script>\n\t<\/div>\n\t\t\t\n\t\n<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n<p>Digital humans have already proven their value in business communication: faster content production, consistent messaging, scalable localization, and always-on presence. But the moment you move from \u201cpresenting\u201d to \u201cconversing,\u201d the bar changes. Users don\u2019t just watch. They interrupt. They ask follow-ups. They challenge assumptions. They expect the response to land with the right tone\u2014and to arrive fast.<\/p>\n\n\n\n<p>That\u2019s where V4 Expressive Visual Agents come in. They take the emotional control and realism of expressive avatars and extend it into <strong>real-time, interactive experiences<\/strong>\u2014streamed live, powered by an LLM, and built to slot into real customer journeys (web, apps, kiosks, internal portals) rather than living as a demo.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"800\" height=\"304\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-7.png\" alt=\"\" class=\"wp-image-13586\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-7.png 800w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-7-300x114.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-7-768x292.png 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-why-emotional-intent-drives-business-roi\"><strong>Why Emotional Intent Drives Business ROI<\/strong><\/h2>\n\n\n\n<p>In business, \u201cemotion\u201d is not about theatrics. It\u2019s about clarity and trust. The same sentence can reassure or escalate depending on how it\u2019s delivered. In high-stakes moments\u2014support, billing, onboarding, healthcare, financial decisions\u2014tone is part of the product.<\/p>\n\n\n\n<p>Now add the conversational layer. In live interactions, emotion becomes even more consequential because the user is reacting in the moment. If the agent feels flat, robotic, or \u201coff,\u201d the user disengages. If it feels aligned\u2014confident when it should be, empathetic when it needs to be, crisp when it\u2019s time to move\u2014the conversation becomes easier to follow, more credible, and more likely to end in resolution.<\/p>\n\n\n\n<p>V4 Expressive Visual Agents are built around that idea: <strong>the face, the voice, and the response timing need to work together<\/strong>\u2014in real time.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"800\" height=\"304\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-8.png\" alt=\"\" class=\"wp-image-13588\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-8.png 800w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-8-300x114.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-8-768x292.png 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-what-makes-v4-expressive-visual-agents-different\"><strong>What Makes V4 Expressive Visual Agents Different<\/strong><\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-expression-based-on-real-human-performance\"><strong>Expression Based on Real Human Performance<\/strong><\/h3>\n\n\n\n<p>The goal isn\u2019t to \u201cadd emotions.\u201d It\u2019s to enable believable delivery that matches intent. V4\u2019s expressive stack is designed for controllability and realism, so the agent can consistently convey the emotional posture you want\u2014across a full response, not just a single word or moment.<\/p>\n\n\n\n<p>In practice, this is what turns an agent from \u201ctalking head\u201d into a presence that feels capable of handling real conversations.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-natural-timing-lip-sync-and-turn-taking\"><strong>Natural Timing, Lip Sync, and Turn-Taking<\/strong><\/h3>\n\n\n\n<p>In real-time conversations, timing <em>is<\/em> UX. A great answer delivered too late (or with awkward pacing) doesn\u2019t feel great anymore.<\/p>\n\n\n\n<p>V4 Expressive Visual Agents are built to support live dialogue\u2014where the response is generated by an LLM and then performed on an avatar with natural pacing and synchronized speech-to-face animation. The experience is streamed as a real-time session, so it feels like an interaction rather than a render pipeline.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-voice-visuals-and-reasoning-developed-as-one-system\"><strong>Voice, Visuals, and Reasoning Developed as One System<\/strong><\/h3>\n\n\n\n<p>A visual agent is not \u201can avatar\u201d plus \u201ca chatbot.\u201d It\u2019s a system that has to orchestrate conversation flow, preserve context, and translate a response into speech and performance\u2014continuously.<\/p>\n\n\n\n<p>With D-ID Agents, you configure the LLM as the agent\u2019s brain (built-in models, external provider keys, or a custom OpenAI-compatible endpoint), and D\u2011ID handles conversation flow and message history routing.<\/p>\n\n\n\n<p>You also define the avatar and voice as part of the same agent configuration, so behavior and presentation stay aligned.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-real-time-streaming-that-s-product-ready-not-a-prototype\"><strong>Real-Time Streaming That\u2019s Product-Ready (Not a Prototype)<\/strong><\/h3>\n\n\n\n<p>V4 Expressive Visual Agents are delivered as <strong>real-time sessions<\/strong> using the D-ID Client SDK, which handles WebRTC streaming and provides a simple chat interface.<\/p>\n\n\n\n<p>That matters because the \u201cagent experience\u201d is not just model quality\u2014it\u2019s the entire interaction loop: connection, latency, turn-taking, and reliability.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"800\" height=\"304\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-9.png\" alt=\"\" class=\"wp-image-13590\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-9.png 800w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-9-300x114.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-9-768x292.png 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-how-expressive-visual-agents-are-used\"><strong>How Expressive Visual Agents Are Used<\/strong><\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-creating-an-expressive-visual-agent\"><strong>Creating an Expressive Visual Agent<\/strong><\/h3>\n\n\n\n<p>At a high level, you\u2019re defining three things: <strong>how the agent looks<\/strong>, <strong>how it sounds<\/strong>, and <strong>how it behaves<\/strong>.<\/p>\n\n\n\n<p>A typical setup flow looks like this:<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Choose an avatar\/presenter<\/strong> (the \u201cface\u201d) and define the default presence (idle behavior, visual style).<\/li>\n\n\n\n<li><strong>Select a voice<\/strong> that matches your brand and audience.<\/li>\n\n\n\n<li><strong>Choose the LLM configuration<\/strong> (built-in, external keys, or custom) and write the agent\u2019s instructions (role, tone, boundaries).<\/li>\n\n\n\n<li><strong>Optional but powerful: add a knowledge base<\/strong> (RAG) so the agent answers using your documents, policies, and product info.<\/li>\n<\/ol>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-running-real-time-agent-sessions\"><strong>Running Real-Time Agent Sessions<\/strong><\/h3>\n\n\n\n<p>Once your agent exists, you can bring it to life in a live environment.<\/p>\n\n\n\n<p>The real-time path is straightforward:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Create a <strong>client key<\/strong> (domain-restricted for frontend usage).<\/li>\n\n\n\n<li>Use the <strong>D\u2011ID Client SDK<\/strong> to connect a video element and initiate a WebRTC session.<\/li>\n\n\n\n<li>Send messages via chat() for normal conversation, or speak() when you want the agent to deliver a specific scripted line.<\/li>\n<\/ul>\n\n\n\n<p>That\u2019s the core difference versus expressive avatar videos: <strong>Visual Agents are designed for live, two-way interaction<\/strong>, not one-way playback.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"800\" height=\"304\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-10.png\" alt=\"\" class=\"wp-image-13592\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-10.png 800w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-10-300x114.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-10-768x292.png 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-top-business-applications-for-emotionally-intelligent-visual-agents\"><strong>Top Business Applications for Emotionally Intelligent Visual Agents<\/strong><\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-learning-and-development\"><strong>Learning and Development<\/strong><\/h3>\n\n\n\n<p><strong>Application:<\/strong> interactive onboarding, scenario training, roleplay coaching<br><strong>The V4 advantage:<\/strong> learners can ask questions mid-flow, get clarifications instantly, and practice realistic conversations with an agent that can hold tone\u2014supportive, firm, encouraging\u2014without breaking character.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-marketing-and-sales\"><strong>Marketing and Sales<\/strong><\/h3>\n\n\n\n<p><strong>Application:<\/strong> website agents for product discovery, qualification, and conversion support<br><strong>The V4 advantage:<\/strong> instead of a static explainer or a text chat bubble, visitors can talk to a face that answers questions in real time\u2014confident when presenting value, curious when qualifying, and concise when guiding to the next step.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-internal-and-leadership-communication\"><strong>Internal and Leadership Communication<\/strong><\/h3>\n\n\n\n<p><strong>Application:<\/strong> internal comms agents, policy assistants, IT\/HR portals, leadership Q&amp;A<br><strong>The V4 advantage:<\/strong> employees get answers quickly, but the delivery also matters: clear when sharing policy, empathetic during change management, and calm during high-pressure moments.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-customer-support\"><strong>Customer Support<\/strong><\/h3>\n\n\n\n<p><strong>Application:<\/strong> front-line triage, guided troubleshooting, account\/billing support, escalation routing<br><strong>The V4 advantage:<\/strong> support is where tone and speed are most tightly coupled. A well-tuned visual agent can reduce friction by acknowledging the user\u2019s state, walking them through resolution steps, and escalating gracefully when needed\u2014while still feeling human and present.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"800\" height=\"304\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-11.png\" alt=\"\" class=\"wp-image-13594\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-11.png 800w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-11-300x114.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-11-768x292.png 768w\" sizes=\"(max-width: 800px) 100vw, 800px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"h-why-expressive-visual-agents-matter-now-scaling-without-flattening\"><strong>Why Expressive Visual Agents Matter Now: Scaling Without Flattening<\/strong><\/h2>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-extending-the-human-reach\"><strong>Extending the Human Reach<\/strong><\/h3>\n\n\n\n<p>Teams are being asked to do more with less: more channels, more languages, more personalization, more support coverage. Visual Agents help scale presence without scaling headcount\u2014but <em>only<\/em> if the experience feels credible enough to represent your brand.<\/p>\n\n\n\n<p>That\u2019s why expressiveness matters. It\u2019s what keeps a scaled interaction from feeling like a downgrade.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"h-the-missing-piece-of-the-digital-puzzle\"><strong>The Missing Piece of the Digital Puzzle<\/strong><\/h3>\n\n\n\n<p>We\u2019ve had chatbots. We\u2019ve had avatars. We\u2019ve had LLMs. The leap is bringing them together into a live experience that feels like a conversation: low-latency streaming, consistent personality, controllable delivery, and knowledge-grounded answers.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"798\" height=\"339\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-12.png\" alt=\"\" class=\"wp-image-13596\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-12.png 798w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-12-300x127.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-12-768x326.png 768w\" sizes=\"(max-width: 798px) 100vw, 798px\" \/><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Ready to Humanize Your Digital Conversations?<\/strong><\/h2>\n\n\n\n<p>If you\u2019re building real-time customer experiences, internal support tools, or interactive training, V4 Expressive Visual Agents are designed to help you deploy a digital human that can actually hold a conversation\u2014fast, expressive, and measurable.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img decoding=\"async\" width=\"798\" height=\"339\" src=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-13.png\" alt=\"\" class=\"wp-image-13598\" srcset=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-13.png 798w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-13-300x127.png 300w, https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-13-768x326.png 768w\" sizes=\"(max-width: 798px) 100vw, 798px\" \/><\/figure>\n\n\n<section class=\"c-block c-margin c-margin--top-default c-margin--bottom-default c-padding--top-default c-padding--bottom-default c-paddingm--top-default c-paddingm--bottom-default c-block b-accordion b-accordion--page-v4-expressive-visual-agents  align b-accordion-layout-default b-accordion--layout-default b-accordion-style-default\" id=\"b-accordion-1\">\n\t<div class=\"c-background c-background--container\" style=\"--bg-color: \">\n    \n    \n    \t    <div class=\"c-background__content\">\n\t\t\t<div class=\"container\">\n\t\t\t<div class=\"b-accordion__inner has-accordion-default-color\">\n\t\t\t\t\t\t\t\t\t<header class=\"c-section-header\">\n\t\t\t\t<h2 class=\"c-el c-title c-section-header__title default\">\n\t<b>FAQs<\/b>\n<\/h2>\n\t\t\t<\/header>\n\t\t\t\t\n\t\t\t\t\n\t\t\t\t<div class=\"c-accordion\" data-type=\"single\" data-open-first=\"true\">\n\t\t<ul class=\"c-accordion__items\">\n\t\t\t\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-0\"\n\t\t\t\t\tdata-id=\"c-accordion__item-0\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-0\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-0\"\n\t\t\t\t\t\t\taria-expanded=\"true\"\n\t\t\t\t\t\t>\n\t\t<b>What is a V4 Expressive Visual Agent?\n<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-0\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-0\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">A real-time conversational AI agent with a digital avatar\u2014powered by an LLM and streamed live so users can talk to it face-to-face.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-1\"\n\t\t\t\t\tdata-id=\"c-accordion__item-1\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-1\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-1\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>How is this different from V4 Expressive Avatars?<\/b><b>\n<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-1\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-1\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">Expressive avatars are optimized for generating videos. Expressive Visual Agents use the avatar in a <\/span><b>two-way, real-time session<\/b><span style=\"font-weight: 400;\">\u2014so the user can ask questions and get responses live.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-2\"\n\t\t\t\t\tdata-id=\"c-accordion__item-2\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-2\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-2\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>What makes it \u201creal time\u201d?<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-2\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-2\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">The agent runs as a live session streamed via WebRTC using the Client SDK, enabling conversational turn-taking and immediate on-screen responses.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-3\"\n\t\t\t\t\tdata-id=\"c-accordion__item-3\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-3\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-3\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>Can I use my preferred LLM or provider?<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-3\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-3\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">Yes. D\u2011ID supports built-in models, external provider keys, and custom LLM integrations via an OpenAI-compatible endpoint.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-4\"\n\t\t\t\t\tdata-id=\"c-accordion__item-4\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-4\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-4\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>Can the agent answer based on my company documents?<\/b><b>\n<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-4\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-4\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">Yes. You can create a knowledge base with RAG by uploading documents, then attach it to the agent.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-5\"\n\t\t\t\t\tdata-id=\"c-accordion__item-5\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-5\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-5\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>How do I measure performance and improve the experience?<\/b><b>\n<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-5\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-5\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">You can export conversations as a downloadable ZIP of JSON chat logs, suitable for analytics, QA, and iteration.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-6\"\n\t\t\t\t\tdata-id=\"c-accordion__item-6\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-6\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-6\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>Is this built for prototypes or production?<\/b><b>\n<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-6\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-6\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">The platform is built around a deployable real-time stack: agent definition, session streaming, optional RAG, configurable LLMs, and exportable logs.<\/span><\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t\t\t<li class=\"c-accordion__item\"\n\t\t\t\t\tid=\"c-accordion__item-7\"\n\t\t\t\t\tdata-id=\"c-accordion__item-7\"\n\t\t\t\t>\n\t\t\t\t\t\n\t\t\t\t\t<h3 class=\"c-el c-title-button c-accordion__item-head default\">\n\t<button \n\t\t\t\t\t\t\tid=\"c-accordion-item-head-7\"\n\t\t\t\t\t\t\taria-controls=\"c-accordion-item-panel-7\"\n\t\t\t\t\t\t\taria-expanded=\"false\"\n\t\t\t\t\t\t>\n\t\t<b>How do I get started?<\/b><b>\n<\/b>\n\t\t<svg width=\"20\" height=\"21\" viewBox=\"0 0 20 21\" fill=\"none\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\" role=\"presentation\">\n\t\t\t\t\t\t\t<line x1=\"20\" y1=\"10.5\" x2=\"-8.74228e-08\" y2=\"10.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t\t<line x1=\"10\" y1=\"20.5\" x2=\"10\" y2=\"0.5\" stroke=\"#090604\" stroke-width=\"2\"\/>\n\t\t\t\t\t\t<\/svg>\n\t<\/button>\n<\/h3>\n\n\t\t\t\t\t<div\n\t\t\t\t\t\tid=\"c-accordion-item-panel-7\"\n\t\t\t\t\t\tclass=\"c-accordion__item-body\"\n\t\t\t\t\t\trole=\"region\"\n\t\t\t\t\t\taria-labelledby=\"c-accordion-item-head-7\"\n\t\t\t\t\t>\n\t\t\t\t\t\t<div class=\"c-text default\">\n\t\t<p><span style=\"font-weight: 400;\">Start by creating an agent (avatar + voice + instructions), then run a real-time session through the Client SDK.<\/span><\/p>\n<p>&nbsp;<\/p>\n\n\t<\/div>\n\n\t\t\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/li>\n\t\t\t\t\t<\/ul>\n\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n","protected":false},"excerpt":{"rendered":"<p>Real-time, emotionally intelligent conversations. Built for product-grade scale. Key Takeaways Digital humans have already proven their value in business communication: faster content production, consistent messaging, scalable localization, and always-on presence. But the moment you move from \u201cpresenting\u201d to \u201cconversing,\u201d the bar changes. Users don\u2019t just watch. They interrupt. They ask follow-ups. They challenge assumptions. They&#8230;<\/p>\n","protected":false},"author":93,"featured_media":13600,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":true,"content-type":"","_uag_custom_page_level_css":"","footnotes":""},"categories":[129,187],"tags":[68,253,251,250,209,252],"class_list":["post-13555","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-express-avatars","category-new-features","tag-generative-ai","tag-ai-avatar","tag-expressiveavatars","tag-newfeature","tag-news","tag-visualagent"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v27.4 (Yoast SEO v27.5) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Introducing V4 Expressive Visual Agents | D-ID<\/title>\n<meta name=\"description\" content=\"Explore D-ID&#039;s blog post about Introducing V4 Expressive Visual Agents and more cutting-edge AI-driven technologies.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Introducing V4 Expressive Visual Agents\" \/>\n<meta property=\"og:description\" content=\"Explore D-ID&#039;s blog post about Introducing V4 Expressive Visual Agents and more cutting-edge AI-driven technologies.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/\" \/>\n<meta property=\"og:site_name\" content=\"D-ID\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/deidentification\/\" \/>\n<meta property=\"article:published_time\" content=\"2026-03-16T14:59:32+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-03-16T14:59:37+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png\" \/>\n\t<meta property=\"og:image:width\" content=\"800\" \/>\n\t<meta property=\"og:image:height\" content=\"444\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Tim Moss\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@D_ID_\" \/>\n<meta name=\"twitter:site\" content=\"@D_ID_\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Tim Moss\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/\"},\"author\":{\"name\":\"Tim Moss\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#\\\/schema\\\/person\\\/a81edf85d82aff6766ae8660228703a2\"},\"headline\":\"Introducing V4 Expressive Visual Agents\",\"datePublished\":\"2026-03-16T14:59:32+00:00\",\"dateModified\":\"2026-03-16T14:59:37+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/\"},\"wordCount\":1198,\"publisher\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.d-id.com\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/image-14.png\",\"keywords\":[\"#GenerativeAi\",\"ai avatar\",\"expressiveavatars\",\"newfeature\",\"news\",\"visualagent\"],\"articleSection\":[\"Express Avatars\",\"New features\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/\",\"url\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/\",\"name\":\"Introducing V4 Expressive Visual Agents | D-ID\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.d-id.com\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/image-14.png\",\"datePublished\":\"2026-03-16T14:59:32+00:00\",\"dateModified\":\"2026-03-16T14:59:37+00:00\",\"description\":\"Explore D-ID's blog post about Introducing V4 Expressive Visual Agents and more cutting-edge AI-driven technologies.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.d-id.com\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/image-14.png\",\"contentUrl\":\"https:\\\/\\\/www.d-id.com\\\/wp-content\\\/uploads\\\/2026\\\/03\\\/image-14.png\",\"width\":800,\"height\":444},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/blog\\\/v4-expressive-visual-agents\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.d-id.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Introducing V4 Expressive Visual Agents\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#website\",\"url\":\"https:\\\/\\\/www.d-id.com\\\/\",\"name\":\"D-ID\",\"description\":\"Create AI Videos, Interactive Avatars to engage your audience. Custom AI-powered digital people at scale for businesses and creators.\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#organization\"},\"alternateName\":\"Interfaces, Evolved.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.d-id.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#organization\",\"name\":\"D-ID\",\"url\":\"https:\\\/\\\/www.d-id.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.d-id.com\\\/wp-content\\\/uploads\\\/2023\\\/11\\\/d-id-logo-1.svg\",\"contentUrl\":\"https:\\\/\\\/www.d-id.com\\\/wp-content\\\/uploads\\\/2023\\\/11\\\/d-id-logo-1.svg\",\"width\":66,\"height\":53,\"caption\":\"D-ID\"},\"image\":{\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/deidentification\\\/\",\"https:\\\/\\\/x.com\\\/D_ID_\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.d-id.com\\\/#\\\/schema\\\/person\\\/a81edf85d82aff6766ae8660228703a2\",\"name\":\"Tim Moss\",\"url\":\"https:\\\/\\\/www.d-id.com\\\/author\\\/tim-moss\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Introducing V4 Expressive Visual Agents | D-ID","description":"Explore D-ID's blog post about Introducing V4 Expressive Visual Agents and more cutting-edge AI-driven technologies.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/","og_locale":"en_US","og_type":"article","og_title":"Introducing V4 Expressive Visual Agents","og_description":"Explore D-ID's blog post about Introducing V4 Expressive Visual Agents and more cutting-edge AI-driven technologies.","og_url":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/","og_site_name":"D-ID","article_publisher":"https:\/\/www.facebook.com\/deidentification\/","article_published_time":"2026-03-16T14:59:32+00:00","article_modified_time":"2026-03-16T14:59:37+00:00","og_image":[{"width":800,"height":444,"url":"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png","type":"image\/png"}],"author":"Tim Moss","twitter_card":"summary_large_image","twitter_creator":"@D_ID_","twitter_site":"@D_ID_","twitter_misc":{"Written by":"Tim Moss","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#article","isPartOf":{"@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/"},"author":{"name":"Tim Moss","@id":"https:\/\/www.d-id.com\/#\/schema\/person\/a81edf85d82aff6766ae8660228703a2"},"headline":"Introducing V4 Expressive Visual Agents","datePublished":"2026-03-16T14:59:32+00:00","dateModified":"2026-03-16T14:59:37+00:00","mainEntityOfPage":{"@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/"},"wordCount":1198,"publisher":{"@id":"https:\/\/www.d-id.com\/#organization"},"image":{"@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#primaryimage"},"thumbnailUrl":"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png","keywords":["#GenerativeAi","ai avatar","expressiveavatars","newfeature","news","visualagent"],"articleSection":["Express Avatars","New features"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/","url":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/","name":"Introducing V4 Expressive Visual Agents | D-ID","isPartOf":{"@id":"https:\/\/www.d-id.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#primaryimage"},"image":{"@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#primaryimage"},"thumbnailUrl":"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png","datePublished":"2026-03-16T14:59:32+00:00","dateModified":"2026-03-16T14:59:37+00:00","description":"Explore D-ID's blog post about Introducing V4 Expressive Visual Agents and more cutting-edge AI-driven technologies.","breadcrumb":{"@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#primaryimage","url":"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png","contentUrl":"https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png","width":800,"height":444},{"@type":"BreadcrumbList","@id":"https:\/\/www.d-id.com\/blog\/v4-expressive-visual-agents\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.d-id.com\/"},{"@type":"ListItem","position":2,"name":"Introducing V4 Expressive Visual Agents"}]},{"@type":"WebSite","@id":"https:\/\/www.d-id.com\/#website","url":"https:\/\/www.d-id.com\/","name":"D-ID","description":"Create AI Videos, Interactive Avatars to engage your audience. Custom AI-powered digital people at scale for businesses and creators.","publisher":{"@id":"https:\/\/www.d-id.com\/#organization"},"alternateName":"Interfaces, Evolved.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.d-id.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.d-id.com\/#organization","name":"D-ID","url":"https:\/\/www.d-id.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.d-id.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.d-id.com\/wp-content\/uploads\/2023\/11\/d-id-logo-1.svg","contentUrl":"https:\/\/www.d-id.com\/wp-content\/uploads\/2023\/11\/d-id-logo-1.svg","width":66,"height":53,"caption":"D-ID"},"image":{"@id":"https:\/\/www.d-id.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/deidentification\/","https:\/\/x.com\/D_ID_"]},{"@type":"Person","@id":"https:\/\/www.d-id.com\/#\/schema\/person\/a81edf85d82aff6766ae8660228703a2","name":"Tim Moss","url":"https:\/\/www.d-id.com\/author\/tim-moss\/"}]}},"uagb_featured_image_src":{"full":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png",800,444,false],"thumbnail":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14-150x150.png",150,150,true],"medium":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14-300x167.png",300,167,true],"medium_large":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14-768x426.png",768,426,true],"large":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png",800,444,false],"1536x1536":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png",800,444,false],"2048x2048":["https:\/\/www.d-id.com\/wp-content\/uploads\/2026\/03\/image-14.png",800,444,false]},"uagb_author_info":{"display_name":"Tim Moss","author_link":"https:\/\/www.d-id.com\/author\/tim-moss\/"},"uagb_comment_info":0,"uagb_excerpt":"Real-time, emotionally intelligent conversations. Built for product-grade scale. Key Takeaways Digital humans have already proven their value in business communication: faster content production, consistent messaging, scalable localization, and always-on presence. But the moment you move from \u201cpresenting\u201d to \u201cconversing,\u201d the bar changes. Users don\u2019t just watch. They interrupt. They ask follow-ups. They challenge assumptions. They...","_links":{"self":[{"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/posts\/13555","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/users\/93"}],"replies":[{"embeddable":true,"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/comments?post=13555"}],"version-history":[{"count":0,"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/posts\/13555\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/media\/13600"}],"wp:attachment":[{"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/media?parent=13555"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/categories?post=13555"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.d-id.com\/wp-json\/wp\/v2\/tags?post=13555"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}