Skip to content
This repository has been archived by the owner on Nov 2, 2020. It is now read-only.

Indexing fails for some pages #2

Open
kahlep opened this issue Mar 29, 2017 · 0 comments
Open

Indexing fails for some pages #2

kahlep opened this issue Mar 29, 2017 · 0 comments

Comments

@kahlep
Copy link

kahlep commented Mar 29, 2017

Some issues to sort out:

03/29 14:00:38.026 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 8446
03/29 14:00:38.040 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 8446
03/29 14:00:38.077 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 1
03/29 14:00:38.078 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 2 of doc = 8446
03/29 14:00:38.203 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 2
03/29 14:00:38.203 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 3 of doc = 8446
03/29 14:00:38.241 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 3
03/29 14:00:38.241 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 4 of doc = 8446
03/29 14:00:38.275 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 4
03/29 14:00:38.275 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 5 of doc = 8446
03/29 14:00:38.314 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 5
03/29 14:00:38.314 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 6 of doc = 8446
03/29 14:00:38.362 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 6
03/29 14:00:38.363 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 14368
03/29 14:00:38.377 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 9 of doc = 14368
03/29 14:00:38.435 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 14368 page 9
03/29 14:00:38.435 INFO | [e.t.s.TrpIndexer, Thread-2998] Commiting...
03/29 14:00:38.538 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing 14 pages.
03/29 14:00:38.539 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 1376
03/29 14:00:38.554 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 486 of doc = 1376
03/29 14:00:38.631 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 1376 page 486
03/29 14:00:38.632 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 486 of doc 1376 could not be indexed!
java.lang.IllegalArgumentException: Comparison method violates its general contract!
at java.util.TimSort.mergeLo(TimSort.java:777) ~[na:1.8.0_102]
at java.util.TimSort.mergeAt(TimSort.java:514) ~[na:1.8.0_102]
at java.util.TimSort.mergeCollapse(TimSort.java:441) ~[na:1.8.0_102]
at java.util.TimSort.sort(TimSort.java:245) ~[na:1.8.0_102]
at java.util.Arrays.sort(Arrays.java:1512) ~[na:1.8.0_102]
at java.util.ArrayList.sort(ArrayList.java:1454) ~[na:1.8.0_102]
at java.util.Collections.sort(Collections.java:175) ~[na:1.8.0_102]
at eu.transkribus.core.model.beans.pagecontent_trp.TrpRegionType.sortRegions(TrpRegionType.java:154) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.beans.pagecontent_trp.TrpPageType.sortRegions(TrpPageType.java:517) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.beans.pagecontent_trp.TrpPageType.sortContent(TrpPageType.java:491) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.builder.TrpPageUnmarshalListener.setParent(TrpPageUnmarshalListener.java:106) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.builder.TrpPageUnmarshalListener.afterUnmarshal(TrpPageUnmarshalListener.java:53) ~[TranskribusCore-0.0.2.jar:na]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.Loader.fireAfterUnmarshal(Loader.java:221) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.StructureLoader.leaveElement(StructureLoader.java:265) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.UnmarshallingContext.endElement(UnmarshallingContext.java:585) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.SAXConnector.endElement(SAXConnector.java:165) ~[na:1.8.0_102]
at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.UnmarshallerImpl.unmarshal0(UnmarshallerImpl.java:243) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.UnmarshallerImpl.unmarshal(UnmarshallerImpl.java:214) ~[na:1.8.0_102]
at javax.xml.bind.helpers.AbstractUnmarshallerImpl.unmarshal(AbstractUnmarshallerImpl.java:157) ~[na:1.8.0_102]
at javax.xml.bind.helpers.AbstractUnmarshallerImpl.unmarshal(AbstractUnmarshallerImpl.java:162) ~[na:1.8.0_102]
at javax.xml.bind.helpers.AbstractUnmarshallerImpl.unmarshal(AbstractUnmarshallerImpl.java:171) ~[na:1.8.0_102]
at eu.transkribus.core.util.PageXmlUtils.unmarshal(PageXmlUtils.java:186) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.createIndexDocument(TrpIndexer.java:487) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.updatePageIndex(TrpIndexer.java:248) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPage(SolrManager.java:85) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPages(SolrManager.java:115) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.server.util.IndexDocumentThread.run(IndexDocumentThread.java:55) [IndexDocumentThread.class:na]
03/29 14:00:38.634 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 3630
03/29 14:00:38.648 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 161 of doc = 3630
03/29 14:00:38.666 ip=172.27.23.213 gui=NA DEBUG | [e.t.s.r.JobMgmt, ajp-bio-8009-exec-28753] got pending jobs: 10
03/29 14:00:38.686 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 3630 page 161
03/29 14:00:38.687 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 80 of doc = 3630
03/29 14:00:38.723 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 3630 page 80
03/29 14:00:38.724 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 81 of doc = 3630
03/29 14:00:38.759 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 3630 page 81
03/29 14:00:38.760 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 3918
03/29 14:00:38.774 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 3918
03/29 14:00:38.815 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 1 of doc 3918 could not be indexed!
java.lang.NullPointerException: null
03/29 14:00:38.816 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 3919
03/29 14:00:38.830 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 3919
03/29 14:00:38.888 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 1 of doc 3919 could not be indexed!
java.lang.NullPointerException: null
03/29 14:00:38.889 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 7699
03/29 14:00:38.903 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 2 of doc = 7699
03/29 14:00:39.039 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 2 of doc 7699 could not be indexed!
java.lang.NumberFormatException: For input string: ""
at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) ~[na:1.8.0_102]
at java.lang.Integer.parseInt(Integer.java:592) ~[na:1.8.0_102]
at java.lang.Integer.parseInt(Integer.java:615) ~[na:1.8.0_102]
at eu.transkribus.solrSearch.util.IndexTextUtils.generateBaseline(IndexTextUtils.java:159) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.util.IndexTextUtils.getWordsFromLine(IndexTextUtils.java:43) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.getWordList(TrpIndexer.java:569) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.createIndexDocument(TrpIndexer.java:520) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.updatePageIndex(TrpIndexer.java:248) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPage(SolrManager.java:85) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPages(SolrManager.java:115) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.server.util.IndexDocumentThread.run(IndexDocumentThread.java:55) [IndexDocumentThread.class:na]
03/29 14:00:39.044 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 8446
03/29 14:00:39.058 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 8446
03/29 14:00:39.704 ip=172.27.23.213 gui=NA DEBUG | [e.t.s.r.JobMgmt, ajp-bio-8009-exec-28607] got pending jobs: 10
03/29 14:00:39.928 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 1
03/29 14:00:39.928 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 2 of doc = 8446
03/29 14:00:39.969 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 2
03/29 14:00:39.969 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 3 of doc = 8446
03/29 14:00:40.004 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 3
03/29 14:00:40.005 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 4 of doc = 8446
03/29 14:00:40.041 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 4
03/29 14:00:40.041 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 5 of doc = 8446
03/29 14:00:40.077 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 5
03/29 14:00:40.077 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 6 of doc = 8446
03/29 14:00:40.112 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 6
03/29 14:00:40.113 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 14368
03/29 14:00:40.127 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 9 of doc = 14368
03/29 14:00:40.164 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 14368 page 9
03/29 14:00:40.164 INFO | [e.t.s.TrpIndexer, Thread-2998] Commiting...
03/29 14:00:40.267 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing 14 pages.
03/29 14:00:40.269 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 1376
03/29 14:00:40.283 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 486 of doc = 1376
03/29 14:00:40.361 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 1376 page 486
03/29 14:00:40.361 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 486 of doc 1376 could not be indexed!
java.lang.IllegalArgumentException: Comparison method violates its general contract!
at java.util.TimSort.mergeLo(TimSort.java:777) ~[na:1.8.0_102]
at java.util.TimSort.mergeAt(TimSort.java:514) ~[na:1.8.0_102]
at java.util.TimSort.mergeCollapse(TimSort.java:441) ~[na:1.8.0_102]
at java.util.TimSort.sort(TimSort.java:245) ~[na:1.8.0_102]
at java.util.Arrays.sort(Arrays.java:1512) ~[na:1.8.0_102]
at java.util.ArrayList.sort(ArrayList.java:1454) ~[na:1.8.0_102]
at java.util.Collections.sort(Collections.java:175) ~[na:1.8.0_102]
at eu.transkribus.core.model.beans.pagecontent_trp.TrpRegionType.sortRegions(TrpRegionType.java:154) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.beans.pagecontent_trp.TrpPageType.sortRegions(TrpPageType.java:517) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.beans.pagecontent_trp.TrpPageType.sortContent(TrpPageType.java:491) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.builder.TrpPageUnmarshalListener.setParent(TrpPageUnmarshalListener.java:106) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.core.model.builder.TrpPageUnmarshalListener.afterUnmarshal(TrpPageUnmarshalListener.java:53) ~[TranskribusCore-0.0.2.jar:na]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.Loader.fireAfterUnmarshal(Loader.java:221) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.StructureLoader.leaveElement(StructureLoader.java:265) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.UnmarshallingContext.endElement(UnmarshallingContext.java:585) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.SAXConnector.endElement(SAXConnector.java:165) ~[na:1.8.0_102]
at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.impl.XMLNSDocumentScannerImpl.scanEndElement(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.XML11Configuration.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.XMLParser.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown Source) ~[xercesImpl-2.8.1.jar:na]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.UnmarshallerImpl.unmarshal0(UnmarshallerImpl.java:243) ~[na:1.8.0_102]
at com.sun.xml.internal.bind.v2.runtime.unmarshaller.UnmarshallerImpl.unmarshal(UnmarshallerImpl.java:214) ~[na:1.8.0_102]
at javax.xml.bind.helpers.AbstractUnmarshallerImpl.unmarshal(AbstractUnmarshallerImpl.java:157) ~[na:1.8.0_102]
at javax.xml.bind.helpers.AbstractUnmarshallerImpl.unmarshal(AbstractUnmarshallerImpl.java:162) ~[na:1.8.0_102]
at javax.xml.bind.helpers.AbstractUnmarshallerImpl.unmarshal(AbstractUnmarshallerImpl.java:171) ~[na:1.8.0_102]
at eu.transkribus.core.util.PageXmlUtils.unmarshal(PageXmlUtils.java:186) ~[TranskribusCore-0.0.2.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.createIndexDocument(TrpIndexer.java:487) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.updatePageIndex(TrpIndexer.java:248) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPage(SolrManager.java:85) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPages(SolrManager.java:115) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.server.util.IndexDocumentThread.run(IndexDocumentThread.java:55) [IndexDocumentThread.class:na]
03/29 14:00:40.364 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 3630
03/29 14:00:40.378 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 161 of doc = 3630
03/29 14:00:40.414 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 3630 page 161
03/29 14:00:40.414 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 80 of doc = 3630
03/29 14:00:40.454 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 3630 page 80
03/29 14:00:40.454 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 81 of doc = 3630
03/29 14:00:40.491 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 3630 page 81
03/29 14:00:40.493 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 3918
03/29 14:00:40.507 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 3918
03/29 14:00:40.549 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 1 of doc 3918 could not be indexed!
java.lang.NullPointerException: null
03/29 14:00:40.550 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 3919
03/29 14:00:40.564 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 3919
03/29 14:00:40.612 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 1 of doc 3919 could not be indexed!
java.lang.NullPointerException: null
03/29 14:00:40.613 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 7699
03/29 14:00:40.628 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 2 of doc = 7699
03/29 14:00:40.740 ip=172.27.23.213 gui=NA DEBUG | [e.t.s.r.JobMgmt, ajp-bio-8009-exec-28607] got pending jobs: 10
03/29 14:00:40.759 ERROR | [e.t.p.l.SolrManager, Thread-2998] Page 2 of doc 7699 could not be indexed!
java.lang.NumberFormatException: For input string: ""
at java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) ~[na:1.8.0_102]
at java.lang.Integer.parseInt(Integer.java:592) ~[na:1.8.0_102]
at java.lang.Integer.parseInt(Integer.java:615) ~[na:1.8.0_102]
at eu.transkribus.solrSearch.util.IndexTextUtils.generateBaseline(IndexTextUtils.java:159) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.util.IndexTextUtils.getWordsFromLine(IndexTextUtils.java:43) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.getWordList(TrpIndexer.java:569) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.createIndexDocument(TrpIndexer.java:520) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.solrSearch.TrpIndexer.updatePageIndex(TrpIndexer.java:248) ~[TranskribusSearch-0.0.1.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPage(SolrManager.java:85) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.persistence.logic.SolrManager.indexPages(SolrManager.java:115) ~[TranskribusPersistence-1.0.jar:na]
at eu.transkribus.server.util.IndexDocumentThread.run(IndexDocumentThread.java:55) [IndexDocumentThread.class:na]
03/29 14:00:40.763 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 8446
03/29 14:00:40.778 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 1 of doc = 8446
03/29 14:00:40.817 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 1
03/29 14:00:40.817 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 2 of doc = 8446
03/29 14:00:40.853 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 2
03/29 14:00:40.853 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 3 of doc = 8446
03/29 14:00:40.895 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 3
03/29 14:00:40.895 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 4 of doc = 8446
03/29 14:00:40.933 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 4
03/29 14:00:40.933 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 5 of doc = 8446
03/29 14:00:40.971 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 5
03/29 14:00:40.971 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 6 of doc = 8446
03/29 14:00:41.008 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 8446 page 6
03/29 14:00:41.009 INFO | [e.t.s.u.IndexDocumentThread, Thread-2998] Indexing doc ID = 14368
03/29 14:00:41.025 DEBUG | [e.t.p.l.SolrManager, Thread-2998] Indexing page Nr. 9 of doc = 14368
03/29 14:00:41.063 ERROR | [e.t.s.TrpIndexer, Thread-2998] XML Unmarshal failed for Doc 14368 page 9

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant