The complete guide to Mbox files: exporting, viewing and archiving from Google Workspace

In modern business, email is the undisputed backbone of our communication. Yet a persistent paradox emerges: the more data we accumulate in Google Workspace, the more complex it becomes to export and access that data. Anyone trying to export their digital history via Google Takeout will quickly encounter enormous Mbox files that crash local computers, cause metadata to disappear or introduce privacy risks. This guide explains how the Mbox format works, how to correctly export your data from Google Workspace and how (using the right tools, completely privately) you can regain control over your email archives.


What is an Mbox file?

The Mbox format is the industry standard for email storage and has existed since the early days of Unix. In essence, an .mbox file is one large text file in which all email messages are stored consecutively. Each message begins with a separator line starting with From (followed by a space), the sender's email address and a timestamp.

info
The Mbox format is based on plain text, making it human-readable and universally applicable — but that same plain-text structure means it was not designed for ease of use at large volumes.

Because the format is based on plain text, it has a number of unique properties:

  • check_circleHuman-readable - you can theoretically open an Mbox file with any text editor.
  • check_circleUniversal - virtually every email client and archiving system understands the format.
  • check_circleSelf-contained - headers, message body and attachments (MIME-encoded) are all stored in a single file.
  • check_circleNo proprietary software required - unlike Microsoft's `.pst` format, which has a complex binary structure that is nearly impossible to open without Outlook.

That universality is precisely what makes Mbox the de facto standard for data export, legal archiving (eDiscovery) and migrations between email providers.

The four Mbox variants

The format has four variants, each with its own method for marking message boundaries. This is relevant if you want to technically understand why certain files cause problems with specific software.

VariantSeparation mechanismCharacteristic
mboxoFixed string (e.g. ???@???)Risk of irreversible corruption if From appears in the message body
mboxrdLines with From in text become >FromFully reversible; used by Thunderbird
mboxclContent-Length header defines message sizeBased on Unix System V
mboxcl2Refinement of mboxclMinor changes in header processing
lightbulb
Google Takeout generates files in the mboxrd variant, the most robust and widely supported choice. You do not need to configure anything for this yourself.

Why Mbox files cause headaches

The Mbox format was not designed for the usage volumes of today. A mailbox spanning ten years with attachments can easily grow to 10 or 20 GB, all in a single file. The moment you try to open or process that file, you run into a series of problems.

System crashes due to RAM exhaustion

Traditional email clients such as Microsoft Outlook do not support the Mbox format at all. Mozilla Thunderbird does support it, but when opening a file it first tries to create an index file (.msf). With a file of 40 GB or more, this process can take hours and completely exhaust your computer's RAM, resulting in freezing or a crash.

Text editors load everything at once

Because Mbox is plain text, users sometimes try to open the file in Notepad or similar editors. But these programs load the entire file into memory at once. Above a few gigabytes, this almost always results in an "out of memory" error or an immediate crash.

warning
Never use a regular text editor to open an Mbox file. With files larger than 2 GB, this will almost certainly lead to a system crash or file corruption.

The four most common problems at a glance

  • check_circleSystem crashes due to RAM exhaustion with files larger than 10 GB.
  • check_circleLoss of metadata such as send dates and folder structure when converting to PST or PDF.
  • check_circlePrivacy risks when uploading sensitive data to unknown online servers.
  • check_circleFile corruption caused by antivirus software modifying the From markers during download.

The privacy pitfall of online converters

Faced with these problems, many users turn to online Mbox converters. This carries a serious risk.

warning
With traditional online viewers, you are uploading your complete email history to an external server. You have no control over what happens to that data, who has access to it or how long it is retained. Under the GDPR, this is in many cases a direct violation of a company's own policy.

An Mbox archive spanning ten years contains password resets, bank statements, medical information and confidential business correspondence. The only safe method is local processing without any upload.


Exporting your Google Workspace data: the right approach

Before you can view your archive, a correct export from Google Workspace is essential. There are two routes: Google Takeout for individual users and the Admin Data Export for organisations.

Takeout vs. Admin Export: which suits you?

devicesGoogle Takeout (individual)
For end users and personal accounts. Accessible with your own login credentials. Exports emails to a download link or Google Drive. No limits on export frequency, but throttled at large volumes.
groupAdmin Data Export (organisation)
For IT administrators. Requires super-administrator rights. Exports the entire organisation including Vault data and suspended accounts. Available only once every 30 days.

Step-by-step: exporting via Google Takeout

1
Step 1: Clean up before the export
Delete emails with extremely large attachments (larger than 10 MB) that you no longer need. Also check that your storage is not nearly full. Google sometimes refuses to start an export when an account is completely blocked.
2
Step 2: Configure Takeout
Go to takeout.google.com and deselect all services except Mail. Click "All mail data included" to choose which labels you want to export. When in doubt, always select "All mail" to prevent archived messages without a label from being skipped.
3
Step 3: Optimise export settings
Choose file type .zip. Choose archive size 20 GB so you have fewer separate files to manage. If you have an unstable connection, choose 2 GB to prevent a failed download from using up one of your five download attempts.
4
Step 4: Wait and download
You will receive an email from Google once the archive is ready. This can take anywhere from a few hours to several days. Download the files via the links you receive and unpack them. The Mbox files are typically located in the folder Takeout/Mail/.
5
Step 5: Open safely with the Mbox Viewer
Drag the Mbox file into the Cloud Captains Mbox Viewer. Within seconds you have full insight into your archive.
warning
Google limits the number of download attempts per generated archive to five. With a large export, any connection error can cost a valuable attempt. After five failed attempts you must restart the entire export process, which can again take several days.

Exporting as an administrator via the Admin Console

Are you an administrator for an organisation? Use the built-in Data Export tool.

Navigate in the Google Admin Console via

Accountarrow_forward_iosData exportarrow_forward_iosData Export

for an organisation-wide export. The files will be delivered to a Google Cloud Storage bucket that you can then download.


Common export problems and how to solve them

"My Mbox file appears to be empty"

This typically has one of the following causes:

  • check_circleSynchronisation error: The index of your mailbox is out of sync with the actual storage. This occurs with accounts that have been accessed via IMAP or POP3 for an extended period. Solution: wait a day and export again.
  • check_circleLabel problem: Emails without a specific label are skipped in a label-specific export. Always export with the "All mail" option selected.
  • check_circleAntivirus corruption: Temporarily disable real-time file scanning while downloading the Mbox file.

"My export is missing thousands of emails"

info
Google throttles the export for large accounts. The export process can take up to 72 hours.

"I have hundreds of zip files"

With a large mailbox, Google splits the data across multiple archives. The metadata of a message may be in one archive while the actual content is in another. The Cloud Captains Mbox Viewer supports loading multiple files simultaneously, giving you a complete overview without having to merge them manually.


Mbox Viewer by Cloud Captains

Our Mbox Viewer was developed to completely eliminate the privacy pitfalls of online converters.

How it works: WebAssembly in the browser

Instead of sending your file to a server, the Mbox Viewer uses WebAssembly (Wasm) a technology that runs code directly in your browser at near-native speed. Your file is processed locally on your own computer. Not a single byte travels over the internet to our servers.

check_circle
You can even disconnect your internet connection after the page has loaded to be 100% certain that your data never leaves your system. Fully GDPR-compliant, even for the most sensitive business archives.

Everything you can do with the Mbox Viewer

shield100% Local & Private
No data upload. Your sensitive business archives never leave your own system. Processing via WebAssembly directly in the browser.
speedBlazing fast processing
Files of 5 GB or more are handled effortlessly without crashes or delays in any modern browser.
searchAdvanced search
Search instantly by keyword, sender, recipient or date. Essential for eDiscovery and legal audits.
auto_awesomeNo installation required
Drag-and-drop interface. No technical knowledge needed. Works directly in Chrome via the extension in the Web Store.

Comparison: solutions side by side

FeatureThunderbirdOnline convertersMbox Viewer (Cloud Captains)
Installation requiredYes, complexNoNo
Data privacyLocal but slowUpload to external server100% local via WebAssembly
Files 50 GB+Crashes or very slowLimited by upload speedBlazing fast local processing
Ease of useTechnical knowledge requiredEasy but riskyDrag-and-drop, no skills needed
View attachmentsYesLimitedFully supported
GDPR-compliantYesQuestionableYes

When is an Mbox archive indispensable for business?

In legal disputes, an organisation must be able to produce relevant communications quickly. An Mbox archive containing all original headers and metadata is a powerful piece of evidence.

info
With the search function of the Mbox Viewer, legal teams can quickly identify relevant emails and save them as EML files for further processing without compromising the integrity of the original archive.

Knowledge retention when staff leave

When a key employee leaves the company, valuable context is often lost. Many organisations export the departing employee's mailbox to Mbox and then remove the licence in Google Workspace to save costs.

Compliance and GDPR archiving

Companies in certain sectors (finance, healthcare, legal) are legally required to retain communications for several years. A locally stored Mbox archive that is searchable with the right tools meets this retention obligation without the high costs of cloud-based archiving solutions.

Migration to a different email provider

If your organisation is switching from Google Workspace to Microsoft 365 (or vice versa), Mbox is the universal intermediate format. A correct export and import via Mbox ensures that all emails, including original timestamps and folder structure, are transferred completely.


Conclusion: regaining control of your digital legacy

The complexity of Mbox files and the friction involved in exporting Google Workspace data remind us that in the cloud we are often merely guests in our own information. The challenges (from download limits in Takeout to the impossibility of opening large files locally) are symptomatic of an ecosystem that makes leaving the platform as difficult as possible.