Vote count:
0
I am Using Terrier on Windows. It is working fine for small dataset, If I try to index a large dataset it gives OutOfMemoryError with the following output
Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
at gnu.trove.TObjectIntHashMap.rehash(TObjectIntHashMap.java:170)
at gnu.trove.THash.postInsertHook(THash.java:359)
at gnu.trove.TObjectIntHashMap.put(TObjectIntHashMap.java:155)
at org.terrier.utility.TermCodes.getCode(TermCodes.java:99)
at org.terrier.structures.indexing.DocumentPostingList.getTermId(DocumentPostingList.java:133)
at org.terrier.structures.indexing.DocumentPostingList$2.execute(DocumentPostingList.java:168)
at org.terrier.structures.indexing.DocumentPostingList$2.execute(DocumentPostingList.java:165)
at gnu.trove.TObjectIntHashMap.forEachEntry(TObjectIntHashMap.java:426)
at org.terrier.structures.indexing.DocumentPostingList.getPostings2(DocumentPostingList.java:165)
at org.terrier.indexing.BasicIndexer.indexDocument(BasicIndexer.java:379)
at org.terrier.indexing.BasicIndexer.createDirectIndex(BasicIndexer.java:270)
at org.terrier.indexing.Indexer.index(Indexer.java:346)
at org.terrier.applications.TRECIndexing.index(TRECIndexing.java:122)
Can anyone guide me with this? I tried changing some parameters in anyclass.bat as instructed by terrier forum, but it didnt help.
asked 22 secs ago
Terrier Java heap space out of memory error
Aucun commentaire:
Enregistrer un commentaire