Rover,
If bass levels and dynamics are good at higher volume, and all is well at lower volumes, then indeed likely room acoustics, possible treatments, speaker placement are the things to look at. Switching gear alone may accomplish little or nothing until these fundamentals are addressed.
To what extent have you played with what you have to get things optimized? Definitely do that first. Nothing wrong with what you have I could determine as described to prevent that.
Is there echo in the room when you speak? That's a good test to determince how lively room might be and how sound echoes within. A lively loft with lots of echo might be a difficult beast to tame fully unfortunately, but I'd be willing to bet playing around with things a bit might help significantly.
I read your speakers are rear ported.
First thing I would try is move both speakers 2-3 feet out away from walls minimum, more if possible and play around with different locations and see what that does, especially if rear ported. Bass may be affected negatively or not, but other current reported problem areas like detail and imaging/soundstage should improve. Adding subs if needed at that point might help.
Use a good mono recording and make sure speakers are close enough to produce a solid center image from your prime listening position or preferred sweet spot., not too far apart as to leave sound muddled and spread out with no center focus. Then try some stereo recordings and see what happens then.
You can also try toeing speakers in or out as well along with placement to help tune things. Might require a lot of experimentation to determine what sounds best at your preferred listening position.
I would try a position with speakers well out from nearby walls so that distance sound reflected of any nearby rear or side wall at each primary reflection point travels from speaker to your ears at your primary listening position or preferred sweet spot is at least twice that of sound that reaches you directly from speakers. This will help address timing issues between direct and reflected sound that will muddy the detail if not within certain tolerances.
If within the tolerances, as determined by relative distances direct and reflected sound travels to your ears, 3-D soundstaging and imaging and associated detail will improve. Try that first and tweak accordingly from there to fine tune.