Hello Ronald,
I'm using Tidy to convert the HTML to XHTML and use the result of that to feed to the Microsoft XML parser and apply XSL templates on that. It works (kind of) fine, apart from the occasional problem when somebody pastes something from Word into the editor.
I've set this up both ways: loading the XML document (from a database), apply XSL to make it HTML. Saving the HTML from the editor feeds it to Tidy, after that it goed to the XML parser which converts it back the format that I want.
It took me quite a bit of time to get this going and it is not fail safe, but it works. You will have to create a couple of safety nets to keep the user away from parsing errors and the like.
If you want more specific details or help on this please contact me directly on bart.roozendaal@sevensteps.nl
Bye,
Bart