| Commit message (Collapse) | Author | Age |
| |
|
|
|
|
|
|
|
|
|
|
|
| |
This entails:
* removal of updateindex scheduling-related properties and code
* turn CreateIndexJson into a regular Java class, which doesn't extend
CollectorMain anymore.
* call index processing functionality after each CollecTorMain run
* add a simple lock mechanism (to CreateIndexJson) in order to prevent
concurrent writes to any of the index.json* files
* adapt all tests accordingly
Implements task-20251.
|
| | |
|
| | |
|
| |
|
|
| |
Implements task-20287.
|
| | |
|
| | |
|
| | |
|
| |
|
|
| |
Found while looking into #25522.
|
| |
|
|
| |
Making the test from the previous commit pass and fixing task-25522.
|
| |
|
|
| |
Making a static method easier accessible for tests.
|
| |
|
|
|
|
|
|
|
| |
We did download bastet's votes in the past after reading the
consensus. But there could have been situations when there was no
consensus that we did not explicitly ask for bastet's vote.
Found per chance while setting up a CollecTor instance with a webstats
module.
|
| |
|
|
| |
Part of task-20234.
|
| | |
|
| | |
|
| |
|
|
|
|
|
|
| |
Clean log lines immediately when they are read and also make use of sanitized
log's high redundancy immediately, i.e., continue with maps of
<LocalDate, <Map<String, Long>>.
Rename method(s) to reflect what they do.
|
| |
|
|
|
|
|
|
|
|
|
|
|
| |
Adapt to latest changes of metrics-lib (task-25329) and make use of the high
redundancy of logs (e.g. a 3G file might only contain 350 different lines).
This avoids OOM and array out of bounds exceptions for large files (>2G) and
gives a speed-up of roughly 50%. (The earlier 66min are down to 34min for
meronense&weschniakowii files plus two larger files.)
There is a BATCH constant, which could be tuned for processing speed. It is
logged for each webstats module run. Currently, it is set to 100k. This
was more or less arbitrarily chosen and used for all the tests. A test run
using 500k didn't show significant differences.
|
| | |
|
| | |
|
| |
|
|
| |
Also edit here and there for more readability and less lines.
|
| |
|
|
| |
Add convenience constructor accepting the first string as argument.
|
| |
|
|
|
|
| |
To avoid possible inconsistencies DescriptorBuilder is finalized after the first
call to 'toString' and cannot be altered anymore. Any attempt to add more leads
to an IllegalStateException.
|
| | |
|
| |
|
|
| |
This might facilitate easier processing of descriptors.
|
| |
|
|
|
|
|
|
|
|
|
| |
The class doesn't 'know' about descriptor sanitization, it is only a sort of
container for writing descriptors. It could be actually moved to some util
package and used in other parsing steps, too.
Also rename test helper classes to avoid naming conflicts.
Remove 'descriptor' from variable names.
Make DescriptorBuilder public.
Adapt other classes as well as tests.
|
| |
|
|
| |
First step towards implementing #20549.
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
|
|
| |
Implements task-22428.
|
| | |
|
| | |
|
| |
|
|
| |
Fixes #24621.
|
| | |
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
|
|
|
|
|
|
|
| |
Typically, the "published" line appears before the "fingerprint" line.
However, an alternative Tor implementation orders these two lines
differently, which is valid due to the spec. We need to handle this
case by accepting lines in either order.
Fixes #23981.
|
| | |
|
| | |
|
| | |
|
| |
|
|
| |
Implements the final part of task-21414 for CollecTor.
|
| | |
|
| | |
|
| | |
|
| | |
|
| |
|
|
|
|
|
|
| |
This includes adding property 'OnionPerfSources' and renaming
some markers properly. In addition, all camel-case occurrences
of 'OnionPerf' have a capitalized 'P' now.
Part of task-21759.
|
| | |
|
| |
|
|
|
| |
With #22836 being deployed, Tor Metrics parses our index.json and
provides its own directory listings. Time to stop prettifying ours.
|