31
votes
Accepted
Benchmarking, why discard lowest time?
The lowest timing might indeed represent the "true" timing without outside interference, or it might be a measurement error. E.g. the boosting behaviour of a CPU might speed up the first run ...
28
votes
Benchmarking, why discard lowest time?
Outliers indicate unusual situations. Outliers are interesting in science, because they give you something to investigate, but they are useless in benchmarks.
If you have 10000 benchmark runs, and ...
8
votes
Benchmarking, why discard lowest time?
I'd add the thought that before doing anything else I would eyeball the data, that is plot the distribution data. I'd do that with most datasets, for that matter. My experience as a retired statto is ...
6
votes
Alternative to Wilson Score when I only have the number of ratings and the average rating?
You can use Wilson scoring, however them main issue with your approach is that you are discretising the data, which results in a loss of information, which is best avoided wherever possible. A better ...
5
votes
Accepted
How to get a useful measure for latency
You will probably want to make several measurements here, because you'll want to understand how the system works both including and excluding the outliers. There are many possible solutions for ...
4
votes
Birthday Paradox, Analytical and Monte Carlo solutions give two systemically slightly different results
Consider when your number of people, n, is 366.
Using your proposed analytic solution for n=366,
you get NumPairs = n*(n-1)/2 = 66,795.
You then say that the probability of two people having
...
3
votes
Accepted
Design a function that indicates significant deviations in response times
I work in a completely different domain, but we have a similar requirement where our system must take action when a measured physical signal is outside a predefined band long enough to be considered ...
3
votes
(AI) algorithm to optimize input parameters
I would recommend to try different standard algorithms for optimization of non-differentiable functions and see how well it works. Out of my head, in increasing complexity:
hill climbing
threshold ...
3
votes
Accepted
Compare two arrays by the number of occurances
You could try an ad-hoc method such as summing the weight of all tags, but that is not a meaningful metric.
A better approach would be to perform statistical inference to answer a question like “what ...
2
votes
Weighted correlation coefficient
Because you are only comparing the candidates to the mean sample of each class, you lose information about the distribution of each class. You are trying to compensate for this by assigning a ...
2
votes
(AI) algorithm to optimize input parameters
SMAC, Sequential Model-based Algorithm Configuration, is a relatively recent approach to this problem. It may well be overkill. It is aimed at the scenario where evaluating a particular configuration ...
2
votes
Accepted
Ranking results from a Question and Answer game
I agree with mmathis that this might be a better question for Math/Stats or even GameDev SE. However, here's a suggestion:
Points per question
Answering questions gets you points and you get more ...
1
vote
Accepted
Optimal variable-time logging of a real-time data stream
Decide how much memory your filter is allowed to consume. This is what you have to work with when deciding if something is interesting enough to send to the file. In here you can hold more than you ...
1
vote
Design a function that indicates significant deviations in response times
Have a look at HdrHistogram.
There are implementations for all kinds of languages.
What it effectively is, is a history of latency distributions. So you could have a latency distribution per second ...
1
vote
Benchmarking, why discard lowest time?
Another point: benchmarks are generally "averaged" using the geometric mean. The geometric mean intrinsically upweights the lowest value in the list, when compared to the algebraic mean.
On ...
1
vote
How to get a useful measure for latency
By themselves the pings are unlikely to show much of anything interesting or useful. Neither are basic stats like median and mean for those. Percentiles are typically more interesting for this kind ...
1
vote
FIFO Min-Max-Heap for Rolling Median
This is a refinement of Jerry Coffin's idea.
Use a nearly balanced tree, where all nodes reside directly in the circular buffer.
Initialize it with dummy values, so that the size stays constant all ...
1
vote
FIFO Min-Max-Heap for Rolling Median
If I were going to do this, I'd probably use a balanced tree (e.g., AVL or red-black) where each node also keeps track of the size of its left sub-tree (and you keep track of the overall size). This ...
Only top scored, non community-wiki answers of a minimum length are eligible
Related Tags
statistics × 61algorithms × 14
python × 5
performance × 5
design × 4
language-agnostic × 4
math × 4
machine-learning × 4
c++ × 3
artificial-intelligence × 3
random × 3
metrics × 3
google × 3
benchmarking × 3
java × 2
php × 2
web-development × 2
testing × 2
data × 2
embedded-systems × 2
real-time × 2
reporting × 2
research × 2
data-mining × 2
c# × 1