all repos — h3rald @ 2d969c84c8af5d78d1533418dbcf62d26b57b3f7

The sources of https://h3rald.com

contents/grimoire/split-mbox-file.md

 1
 2
 3
 4
 5
 6
 7
 8
 9
 10
 11
 12
 13
 14
 15
 16
 17
 18
 19
 20
 21
 22
 23
 24
 25
 26
 27
 28
 29
-----
id: split-mbox-file
title: "Split a .mbox file into chunks"
subtitle: "Apparently, Apple Mail insists on copying the entire .mbox file contents into RAM when importing..."
content-type: spell 
-----

The MBOX format is essentially a textual format for exporting emails as single files. As a result, if you try to export several years of emails, these files can get big pretty quickly.

While this may not be a problem, some email clients such as Apple Mail insist on doing stupid things like loading (very inefficiently, too) the entire contents of the file into RAM when importing it, which will likely cause your MacBook to scream that you are out of application memory.

To avoid this, you can use this nifty script to tidily split your .mbox file into chunks of 1GB each (adjust as needed):

```awk
BEGIN{chunk=0;filesize=0;}
    /^From /{
    if(filesize>=1000000000){#file size per chunk in byte
        close("chunk_" chunk ".txt");
        filesize=0;
        chunk++;
    }
  }
  {filesize+=length()}
  {print > ("chunk_" chunk ".txt")}
```

Credits: [StackOverflow](https://stackoverflow.com/questions/28110536/how-to-split-an-mbox-file-into-n-mb-big-chunks-using-the-terminal)