(yes, one more old thread getting bumped by me...)
i think the extractor should stay under /contrib/ but needs a general update/rewrite (the code is *really* messy). i'd also like to see it being 64bit safe. using ACE for it might be a good idea. i'm not very good in threading, but it might also give a speedup if we'd use ACE to make the extractors (map + vmap) multithreaded (not sure whether IO or calculations use more power, but i'd say calculations, so multithreading might be worth it).
i already started a branch where i'm working on making projects under 64bit safe (and also doing other stuff on the projects^^): http://github.com/DasBlub/mangos/commits/x64
maybe i'll give it a shot later, when i'm done with the other projects. anyone is free to already do it