Maybe the "scientific test" does not have to include a perfect test environment. As long as a new, and a broke-in speaker are tested in the same environment, then comparative test results can be shown. All we would like to know is if there is a sound difference between the two... the trick would be having the testing plan together before you start. If you waste too much time with test methodolgy with the speakers, you will accidentally break-in the new speaker before you really start tracking the results (yes, I am a dork and have a background in this sort of thing).
Even if you are a believer in break-in periods, it will take effort to damage the speakers, I doubt you could accidentally damage them just because they are not broke-in. In my opinion, the break-in period has to do with sound quality. Since you have a month to audition your 532's, you might as well give them some serious listening and time to think about it. Then regardless of whether you believe in break-in or not, you will have given the speakers a fair shot.