So I had some kind of stub value against which I was physically testing. Of course these kind of values get a different address when you cross boundaries and this had catastrophic effects on the computation in this case.
Conclusion: value your option types :–)
I now get my results within the same amount of time, just a bit disappointed by the high transfer time overhead.