The sun shield will indeed 'sail' solar radiation pressure (SRP) to a degree (that will vary widely according to spacecraft orientation which is of course dictated by science targets), but that has been factored in.
First of all, the L2 libration point 'orbit' (unstable by its very nature) is ever-so-slightly (sun-)biased to compensate for this additional force (trade it off against the gravitational forces). The trajectory (because it's not really an orbit as such) has to be corrected every ~21 days with a station-keeping burn anyway to compensate for the instability ie N-body gravitational forces (same SCAT thrusters as are being used for these mid-course corrections en route to L2), otherwise the observatory would either 'decay' back to to a geocentric orbit (interaction with the Moon making it somewhat more complex than a highly eccentric Earth-centred ellipse) or alternatively be perturbed out into a heliocentric orbit.
Secondly, SRP will also create a torque about the centre of mass (the sun shield won't always present full plane perpendicular on to the Sun radius vector). The reaction wheels will compensate for this, though in the process they accumulate angular momentum themselves, the rate depending on orientation of the spacecraft with respect to the Sun and duration spent in that attitude. So periodically they will have to be despun (there is naturally a limit on how much angular momentum they can accumulate - they can only spin so fast). To minimise this, astronomical observation sequences will be planned in such a way that the spacecraft will change observing direction to alternately load and then unload reaction wheels as much as is possible.
Obviously there is a balancing act between this engineering need and the astronomical science programme. The reaction wheels can only be unloaded in the aforementioned manner to some degree. So periodically the observatory also uses the thrusters to permit reaction wheels to be fully despun.
Additionally the spacecraft features a momentum trim tab (or flap) that helps balance out SRP (working in tandem with the reaction wheels). This 'paddle' uses the SRP to help reduce reaction wheel angular momentum build-up and fuel burn that would otherwise be spent on maintaining attitude (countering the torque).
View attachment 303458