Multi-room audio is the feature that makes people’s houses feel like smart homes. Getting it right — where automations trigger music, voice commands work in every room, and everything synchronizes — requires thinking through the stack carefully. Get it wrong and you’ve got three apps, three protocols, and speakers that can’t talk to each other.
Here’s how I’d build it in 2026, with a Home Assistant-first perspective.
What “Good” Looks Like
Before diving into hardware, here’s what you’re actually trying to achieve:
- Music follows me around the house (or plays everywhere at once)
- Automations can start, stop, adjust volume, and switch sources — in any room, on any trigger
- TTS announcements work on every speaker (doorbell, weather, reminders)
- Guests can play music without installing an app or getting a lecture about protocols
- No cloud dependency for basic playback control
These requirements eliminate most of the consumer speaker market immediately. Most Bluetooth speakers don’t talk to HA. Most smart speakers from Amazon and Google work but are cloud-dependent — if Amazon’s servers are slow, your morning routine announcement delays. What’s left is a much smaller field.
The Architecture That Works
Multi-room audio in HA lives and dies by your streaming protocol choice. The three that actually work well:
AirPlay 2 — Apple’s protocol. Synchronized playback across zones, great iOS/macOS integration, low latency, works over LAN. HA supports AirPlay 2 receivers natively. Works on WiiM, Denon, Yamaha HEOS-capable receivers, and third-party software like forked-daapd.
Snapcast — Open source, LAN-only, sub-millisecond synchronization, works with any audio endpoint that can run a Snapcast client (Raspberry Pi, Linux boxes, old computers). Fiddly to set up, genuinely excellent once running. The audiophile home-server option.
Sonos — Proprietary but excellent. Solid HA integration, great app, consistently reliable synchronization. The consumer-friendly option.
For most people: WiiM over AirPlay 2 is the best balance of cost, quality, and HA integration. Here’s how I’d build each tier.
Tier 1: The WiiM-First Stack (~$800-1,500 total)
This is what I’d build if I was starting from scratch today. WiiM hardware, AirPlay 2 for synchronization, HA for automation.
Living Room / Main Zone:
WiiM Amp Ultra ($499) driving a pair of passive bookshelves. I pair mine with ELAC Debut 3.0 DB63s ($280). Total: ~$780 for the main zone and it sounds genuinely excellent. HDMI ARC means the TV goes through the same amp. RoomFit EQ makes the room sound better without any additional hardware.
Secondary Rooms: WiiM Amp (~$299) per room, each driving whatever passive speakers you have or a cheap set of bookshelves. The original WiiM Amp is 60W x2, supports AirPlay 2, Google Cast, Spotify Connect — it does everything you need at a price that makes it reasonable to put in multiple rooms.
Rooms Without Speakers: WiiM Pro Plus (~$149) as a streaming preamp feeding existing powered speakers or an existing amp. This is the option for rooms where you already have something that can amplify — the WiiM Pro Plus gives it streaming capabilities without replacing it.
HA Integration:
Every WiiM device shows up as a media_player entity. You can group them for synchronized playback, control volume independently, and trigger automations that play different content in different rooms simultaneously. I have an automation that plays jazz in the kitchen while news plays in the home office during my morning routine. Both start simultaneously, both can be controlled from the HA dashboard.
Total for 3-room WiiM setup: ~$1,100-1,400 depending on speaker choices.
Tier 2: Sonos for Simplicity (~$700-1,500)
If you want the simplest setup with the best app experience and you don’t mind the ongoing cloud dependency, Sonos is still the benchmark for consumer multi-room audio.
The Case for Sonos in 2026: After their disastrous app redesign in 2024, Sonos spent most of 2025 fixing it. The app as of early 2026 is back to being good, and the hardware remains excellent. The Era lineup is the best hardware they’ve shipped.
Sonos Era 100 (~$249) — Stereo pair of drivers in a small package. AirPlay 2 plus Sonos native protocol. Works as a standalone bedroom or office speaker. HA integration is mature.
Sonos Era 300 (~$449) — Spatial audio and Dolby Atmos support. Sounds excellent for music and is probably overkill as a smart home speaker, but if you want a living room speaker that also handles TV spatial audio, it’s compelling.
Why I’d still pick WiiM over Sonos for a new build: Sonos is more expensive per zone. The Era 100 at $249 gives you a single speaker; the WiiM Amp at $299 drives your own passive speakers and gives you more flexibility. If you already have passive speakers or want to upgrade them independently, WiiM wins. If you want a zero-thought speaker you can put on a shelf and forget about, Sonos is the easier answer.
HA + Sonos: The integration is good. You get media player entities for every Sonos device, grouping works, TTS works. The main limitation: Sonos sometimes requires cloud for initial setup even when LAN control is otherwise available. It’s not a dealbreaker, but it’s not as clean as WiiM’s local-first story.
Tier 3: The Apple Ecosystem Option
If your house runs on Apple — iPhone, iPad, HomePods already — there’s an argument for leaning in.
Apple HomePod mini (~$99) as secondary-room speakers. HomePod minis are AirPlay 2, they work as Siri endpoints, and they’re small enough to put anywhere. They don’t have great bass or volume for serious listening, but for TTS announcements, background music in a small room, or a kitchen counter speaker, they’re excellent value at $99.
The catch: HomePod minis in HA work via HomeKit, which means a slightly indirect integration path. They work — you can control them from HA as media player entities — but the integration isn’t as tight as WiiM’s native HA support. If you’re deep in the Apple ecosystem, that trade-off makes sense. If not, WiiM remains the better choice.
The Automation Layer
Hardware is only half of it. Here’s how I have my multi-room setup automated:
Morning routine (weekdays, 6:30 AM):
- Bedroom lights fade on slowly
- WiiM Amp Ultra (living room) and WiiM Amp (kitchen) join an AirPlay 2 group
- Both play a morning playlist at 20% volume
- A TTS announcement plays the weather forecast through the kitchen speaker at 6:45
Arriving home:
- Presence detection triggers when I’m within 5 minutes of home
- Living room WiiM Amp starts playing — resumes whatever was last playing, or starts a default playlist if nothing was queued
- Volume ramps from 10% to 35% over 60 seconds so it doesn’t blast me through the door
Movie time:
- HA detects my Apple TV is playing (via the Apple TV integration)
- All WiiM Amps in the house except the living room mute
- Living room lights dim to 40%
Late night:
- After 10 PM, any speaker that’s been idle for 30 minutes turns off
- The bedroom WiiM Amp volume is capped at 30% after 10 PM to prevent accidentally blasting the house
All of this runs locally. No cloud services in the loop for any of these automations. If my internet goes down, everything still works.
What to Avoid
Smart TV built-in speakers — no HA control, no integration, no synchronization.
Google Home / Nest Audio — works, but everything runs through Google’s cloud. HA integration exists but isn’t as clean, and you’re dependent on Google keeping their API stable (see: the Stadia story, the various deprecated Google products).
Cheap Bluetooth speakers — they don’t participate in multi-room setups, they don’t have HA integrations, and they require a phone to be the source. Fine for portable use, wrong for home automation.
Amazon Echo — works with HA through Alexa integration, but the same cloud dependency problem as Google. Also: Amazon has been unreliable about maintaining their HA integration. Use them as voice command endpoints if you already have them, but don’t build your audio infrastructure around them.
My Actual Recommendation
Start with the WiiM Amp Ultra in your main room, pair it with ELAC Debut 3.0s or whatever passive speakers you already own, and add WiiM Amps in secondary rooms as your budget allows. Get all of them into HA as media player entities, group them via the AirPlay 2 integration, and start building your automation stack.
The total cost for a three-room setup that sounds great and integrates deeply with Home Assistant is around $1,100-1,400. That’s less than a single Denon HEOS receiver with a soundbar and no passive speaker flexibility.
This is the part of the smart home space that’s genuinely gotten better. Three years ago, a fully local, HA-integrated, multi-room audio setup required either Sonos (expensive) or serious DIY with Snapcast and Raspberry Pis. Today WiiM makes it approachable, affordable, and excellent. Build it.
The Bottom Line
For HA-first multi-room audio in 2026: WiiM is the answer for most people. AirPlay 2 for synchronization, HA for automation, passive speakers for best sound-per-dollar. Start with the Amp Ultra in your main room, add Amps to secondary rooms, build your automations. If you want the zero-friction consumer option, Sonos Era hardware is excellent — just know you’re trading some flexibility and HA tightness for a better out-of-box experience. Either way, the infrastructure to have music everywhere, controlled by automations, with no cloud dependencies is available and affordable right now.