i think, as I said above in the thread, something you could really benefit from is some quality reverbs, both to enhance the spatial image (different sounds would be in different locations of this virtual space, and thus get different reverb tails), as well as the fact that reverbs tend to push things into the background, wash them out a bit, and hopefully sort of mush/glue together sounds when they occur in short repetitive succession.
i just had another idea, and you just have to try this and see if it works or what it does: the exact millisecond when an event occurs is not that important. I think you can safely shift an event 20-50ms in time, without losing important accuracy. You could use this leeway to space similar events apart slightly, so they don't give that glitchy "retrigger" sound effect as much. It'd take some code, but you could even buffer all events for say 200ms, and use that sliding window to space them out as evenly as possible. I don't know if it would improve things, because it's really not what happens in physical ambient sounds, but it seems to me like it would smoothen things up a bit more. You could give it a try.