Some digging through Python posts solved my problem--apparently the latest re version (sre, actually) has Unicode support--but it has stack limitations that don't allow you to send large chunks of text through it. The solution, if my question raised your own questions, is to import pre as re in your external method.