encode unicode for lxml