You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On line 102 of commoncrawl/src/main/java/org/commoncrawl/util/shared/ARCFileReader.java, there's a comment that says the constructor is private (it's actually public), and refers to the "factory method above" even though it's the first method in the file.
/**
* constructor is now private. use the factory method above to construct a reader
* @param source
* @throws IOException
*/
public ARCFileReader(final InputStream source)throws IOException {
super(new CustomPushbackInputStream(new CountingInputStream(source),
_blockSize), new Inflater(true), _blockSize);
readARCHeader();
}
The text was updated successfully, but these errors were encountered:
Thanks, I'll have a look at it. But just for clarification: this library is used to access the Common Crawl data from 2012 or earlier. Recent data uses a different format (WARC instead of ARC).
On line 102 of commoncrawl/src/main/java/org/commoncrawl/util/shared/ARCFileReader.java, there's a comment that says the constructor is private (it's actually public), and refers to the "factory method above" even though it's the first method in the file.
The text was updated successfully, but these errors were encountered: