Podcast Episode File Size and Bitrate Estimator
Calculate multi-track WAV recording storage needs and final MP3/AAC export sizes against your SD card and podcast host limits.
The Complete Guide to Podcast File Sizes and Bitrate
File size is one of the most misunderstood variables in podcasting. Raw multi-track sessions can balloon to dozens of gigabytes while the same content exports to under 50 MB as an MP3. Understanding the math behind these numbers helps you choose the right SD card, stay within hosting limits, and avoid the panic of a failed upload on release day.
How Raw WAV File Size is Calculated
An uncompressed WAV file stores every sample your microphone captures. The formula is straightforward: Sample Rate multiplied by Bit Depth gives you the number of bits per second per track. Divide by 8 to get bytes, multiply by the number of seconds, then multiply by the number of tracks.
Example: a 2-track podcast recorded at 48 kHz and 24-bit for 60 minutes produces (48,000 x 24 / 8) x 3,600 x 2 = approximately 829 MB. That is why serious multi-track sessions with 4 or more guests routinely fill 32 GB or 64 GB cards during a single recording day.
How Compressed Export File Size is Calculated
MP3 and AAC files are encoded at a constant bitrate measured in kilobits per second. The formula collapses to: (Bitrate in kbps x 1000 x Duration in seconds) / 8 = bytes. A 60-minute mono episode at 96 kbps produces (96,000 x 3,600) / 8 = approximately 43 MB - well within every major hosting platform's limit.
Stereo doubles the effective data because the encoder must carry two channels of frequency content. This is why 128 kbps is the accepted minimum for stereo and 64 kbps is only usable for mono voice recordings where file size is the top priority.
How to Use This Tool
Switch to Raw Recording mode to plan your SD card capacity before a recording session. Enter the expected total duration, the number of microphone or instrument tracks your setup will capture, and choose your interface's sample rate and bit depth. The estimator tells you whether your card has enough room and flags any overflow.
Switch to Final Export mode to verify your compressed episode will pass your host's upload limit before you encode. Enter the episode length, choose mono or stereo, select your target bitrate, and pick your hosting platform from the dropdown.
Frequently Asked Questions
For spoken-word podcasts, mono is almost always the right choice. Human speech is a single-point sound source, meaning there is no meaningful spatial information lost when collapsing to mono. Exporting in stereo doubles the file size for zero audible benefit to the listener.
Mono at 128 kbps is perceptually identical to stereo at 256 kbps for voice content, and it plays perfectly on earbuds, car stereos, smart speakers, and phones. The only time stereo is worth it is if your podcast includes significant music beds, sound design, or ambient soundscapes where left-right panning adds to the experience.
96 kbps mono is the sweet spot for voice-only podcasts. It produces excellent speech clarity at a file size that fits comfortably within most hosting limits even for long episodes. 64 kbps mono is acceptable for very budget-conscious hosts with tight storage limits but can sound slightly hollow on some playback systems.
128 kbps mono is a safe choice if you include music intros, outros, or transitions. Anything above 128 kbps for mono voice audio is diminishing returns - your listeners will not hear the difference, but your storage and bandwidth costs go up.
WAV files are uncompressed. Every single sample captured by your audio interface is stored as a raw number. At 48 kHz, 24-bit, that is 48,000 samples per second times 3 bytes per sample times the number of tracks. A 4-track session at those settings produces about 82 MB per minute.
MP3 compression uses psychoacoustic modeling to discard frequencies the human ear is statistically unlikely to notice, then encodes what remains at a fraction of the original size. A 96 kbps mono MP3 uses only 0.72 MB per minute - over 100 times smaller than a multi-track WAV session at the same duration. This is why your DAW project might require a 64 GB SD card while the finished episode fits in 50 MB.
Yes. 32-bit float uses 4 bytes per sample compared to 3 bytes for 24-bit and 2 bytes for 16-bit, so a 32-bit float file is 33% larger than the equivalent 24-bit recording and 100% larger than 16-bit. The benefit of 32-bit float is headroom protection - it virtually eliminates clipping because the format can represent levels far above and below what any microphone can actually produce.
Recorders like the Zoom F3 and Sound Devices MixPre series use 32-bit float so the gain setting becomes irrelevant during capture. For most podcast setups with a controlled environment and proper gain structure, 24-bit is more than sufficient and saves meaningful storage space.
Upload limits vary by plan and platform. Libsyn Basic caps individual episode files at 50 MB, while Libsyn Pro raises this to 250 MB. Spotify for Podcasters (formerly Anchor) allows episode files up to 250 MB. Podbean enforces a 100 MB per-episode limit on most plans.
These are per-episode file size limits, not monthly storage quotas. A 60-minute episode at 96 kbps mono MP3 is approximately 43 MB - safely under all major limits. Problems arise with very long episodes, high-bitrate stereo exports, or when trying to upload raw WAV masters instead of compressed exports.
48 kHz is the standard for podcasting and video production. It is the default for broadcast, video platforms, and most professional audio interfaces. 44.1 kHz is the CD standard and is perfectly fine for audio-only podcasts distributed as MP3. The difference is inaudible to virtually all listeners.
96 kHz is a high-resolution recording format that nearly doubles your file sizes with no meaningful benefit for voice content. Human speech contains very little energy above 8 kHz, and even the best microphones rarely capture meaningful content above 20 kHz. Record at 48 kHz, export to MP3 at the appropriate bitrate, and spend your storage budget on more tracks or longer recordings instead.