Ruben Laguna’s blog

Lucene 2.9.0/3.0.1 Memory Leaks

If you are experiencing high memory comsumption with Lucene 2.9.0/3.0.1 it could be due…

  1. … to a recently reported and fixed bug in StandardTokenizer where JFlex generated code was expanding a buffer (zzBuffer) and never trimming it down
    Eclipse Memory Analyzer
    Uploaded with plasq’s Skitch!

  2. … to another recently reported and fixed bug where IndexWriter held references to Readers used in your Fields, (and if you have apache tika’s reader, those can take up a lot of space)
    Eclipse Memory Analyzer-1
    Uploaded with plasq’s Skitch!