Using regex to fix xml content -
I use regular expressions to match some bilingual XML files with known patterns of incorrect expressions and replace true values I am trying to do Most of the problems in XML files can consider typo or redundant data.
I have a text processing tool that works in software without any regex support, but the whole situation would be so simple if I could just use something similar to a batch job using SAL or script Could leave and leave overnight. For example, a script that should resolve the problem might look like the following:
#! / Bin / sed -fs / & lt; Prop type = "Att :: status" gtc: mediawiki-xid = "att :: status" & gt; not valid / gs / lift; prop type = "et :: position" & gt; approved / Amp; Lift; prop type = "Att :: Status" & Gt; Valid I have found that sed UTF does not quite like 16 files, and Since we are working with bilingual XML in 34 different language combinations, it could be a very dangerous tool to use to wrap around ISKV like sed scripts The non-standard charset conversion tool causes any kind of corruption and I want to decide the rest of the week in which languages the script works properly.
It is also worth mentioning that XML is full of deposits In the last few years a client's translation, so there are many poor syntax which can support some tools.
In short, sed + iconv is very risky, I basic basic lessons Of Interior equipment, I have Notepad ++, I have a list of expression to convert CAD syntax I but no easier / better way?
See this is a command line tool set which is to read / manipulate XML.
Specifically, the command might specify what you want to change, what you want to change, and how to change it, its XPaths. It will respect the specified XML character encoding, which will not be your standard command line tool.
Comments
Post a Comment