Java – how to split large XML into small pieces using vtdgenhuge?
I want to break the large XML into small pieces I am using vtdgen to split the XML file into small pieces, which is suitable for file sizes < 2 GB VTD - XML uses in - memory to parse XML. I don't want to load XML into memory So I tried to map memory using vtdgenhuge The code applies to vtdgen, but it doesn't work when I use vtdgenhuge
String prefix = "<?xml version=\"1.0\" encoding=\"UTF-8\"?>\n"+"\n"; String suffix = "\n</Employees>\n"; try { VTDGenHuge vg = new VTDGenHuge(); if (vg.parseFile("C:\\Users\\abc\\Desktop\\latestxml\\Input_1.xml",true,VTDGenHuge.MEM_MAPPED)) { int splitBy = ; System.out.println("Started time"+ new Date()); VTDNavHuge vn = vg.getNav(); AutoPilotHuge ap = new AutoPilotHuge(vn); ap.selectXPath("/Employees/Employee"); FastLongBuffer flb = new FastLongBuffer(4); int i; byte[] xml = vn.getXML().getBytes(); while ((i = ap.evalXPath()) != -1) { flb.append(vn.getElementFragment()); } int size = flb.size(); if (size != 0) { File fo = null; FileOutputStream fos = null; for (int k = 0; k < size; k++) { if (k % splitBy == 0) { if (fo != null) { fos.write(suffix.getBytes()); fos.close(); fo = null; } } if (fo == null) { fo = new File("C:\\Users\\abc\\Desktop\\Test\\xml\\"+"out" + k + ".xml"); fos = new FileOutputStream(fo); fos.write(prefix.getBytes()); } fos.write("\n".getBytes()); fos.write(xml,flb.lower32At(k),flb.upper32At(k)); } if (fo != null) { fos.write(suffix.getBytes()); fos.close(); fo = null; } } } } catch (Exception e) { e.printStackTrace(); }
I get the null value as "byte [] XML = VN. Getxml(). Getbytes();" When you do syso VN When getxml (), you get the value of the object But "getbytes()" returned null I don't know why? But if you do "byteat (x)" x = any long value, it returns a value
My XML file is:
<?xml version="1.0" encoding="UTF-8"?> <Employees> <Employee id="1"> <age>29</age> <name>Pankaj</name> <gender>Male</gender> <role>Java Developer</role> </Employee> <Employee id="2"> <age>35</age> <name>Lisa</name> <gender>Female</gender> <role>CEO</role> </Employee> <Employee id="3"> <age>40</age> <name>Tom</name> <gender>Male</gender> <role>Manager</role> </Employee> <Employee id="1"> <age>29</age> <name>Pankaj</name> <gender>Male</gender> <role>Java Developer</role> </Employee> <Employee id="2"> <age>35</age> <name>Lisa</name> <gender>Female</gender> <role>CEO</role> </Employee> <Employee id="3"> <age>40</age> <name>Tom</name> <gender>Male</gender> <role>Manager</role> </Employee> <Employees>
I want to come out like this
<?xml version="1.0" encoding="UTF-8"?> <Employees> <Employee id="1"> <age>29</age> <name>Pankaj</name> <gender>Male</gender> <role>Java Developer</role> </Employee> <Employee id="2"> <age>35</age> <name>Lisa</name> <gender>Female</gender> <role>CEO</role> </Employee> <Employee id="3"> <age>40</age> <name>Tom</name> <gender>Male</gender> <role>Manager</role> </Employee> <Employees> <?xml version="1.0" encoding="UTF-8"?> <Employees> <Employee id="1"> <age>29</age> <name>Pankaj</name> <gender>Male</gender> <role>Java Developer</role> </Employee> <Employee id="2"> <age>35</age> <name>Lisa</name> <gender>Female</gender> <role>CEO</role> </Employee> <Employee id="3"> <age>40</age> <name>Tom</name> <gender>Male</gender> <role>Manager</role> </Employee> <Employees>
resolvent
Solution
I think the VN. XML extension of VTD XML Getxml () returns an ibytebuffer interface object different from the standard VTD XML You can call the inteface method named writeoutputto file () and pass the offset and value parameters to it Sorry, the document part is missing, but this is a basic low-level... @ h_ 301_ 4@ @H_ 301_ 4@