Roll20Aggregator

Help About How parsing works How results are analyzed Limitations

Help

How do I get the chat log for my game?

Open the Chat Archive of your Roll20 game. This can be found on the game page in the Content dropdown menu.
Click "Show on One Page".
Open your browser's developer tools. This can usually be done with F12 or Ctrl+Shift+I on Windows or Command+Option+I on Mac; or the browser menu.

In the developer tools, navigate to the Elements or Inspecor tab (browser-dependent), where you should see something like the following:

<html>
    <head>...</head>
    <body class="css" style="height: 100%;">
    <h1 class="pull-left" style="margin: 0px 0px 6px 6p;">Chat Archive</h1>
    <a href="..." style="display:inline-block;padding:10px;">Return to Details Page</a>
    <div class="clear"></div>
    <div style="margin:10px";>...</div>
    <div>...</div>
    <div class="clear"></div>
    <div class="textchatcontainer" id="textchat">...</div>
    ...

Right click on the following line and select Copy. In different browsers, this may appear as Copy element or Copy inner HTML.
```
<div class="textchatcontainer" id="textchat">...</div>
```
Paste the contents into the text processor of your choice (eg Notepad) and save as a .txt file.
Upload this file to the Roll20Aggregator. Enjoy!

Do not attempt to save the chat log as a .html file directly from the browser via the Save command. This encodes the HTML in a way that cannot be parsed by the aggregator.

If you're unable to parse your chat log, please send an email to roll20aggregator@outlook.com. Please attach your chat log file for testing.

About

Roll20 Aggregator is a web app that parses the chat log of a Roll20 game to display aggregate statistics and answers questions like, who wrote the most messages? Who rolled the most most 1s? Who was the luckiest? Were the virtual dice fair?

More fully, the following features are available upon uploading a chat log:

Pie chart and tabular representations of message and dice data
Ranking of highest to lowest rollers, expressed as Z score across all dice
Breakdown of roll results for individual die types in raw count or percent format, sortable by character or result column
Chi square analysis to determine fairness of dice
Creation of character groups to be considered jointly as single characters
Roll log - useful to validate the parser with the chat log
Filter displayed data by selected characters/groups

Roll20 Aggregator was built using the following technologies:

.NET 8
Blazor
Bootstrap 5.3.0
Font Awesome 6.6.0
AngleSharp 1.1.2: used to parse HTML
Radzen.Blazor 5.1.3: used to display pie charts
Accord.Statistics 3.8.0: used for chi square test and pooled standard deviation calculations

This site does not store any user data.

This site is not affiliated with Roll20.

How Parsing Works

Parsing Rolls

The parser works by checking the HTML of each .message div and parsing it for certain classes and attributes that identify rolls.

There are two chief classes of message to look out for: .rollresult, which recodesents a roll block (the more pictographic result you get if you were to type /r d20); and .diceroll, which recodesents an inline roll (the more compact, textual result you get if you were to type [[d20]]).

In a roll block, the relevant roll information can be obtained from the .dicegrouping div. In the below example, we can see from the .diceroll div that a d6 was rolled, and from the .didroll div that a 3 was the raw roll. Importantly, this the value that was rolled, before any modifiers were applied.

<div data-origindex="0" class="diceroll d6">
    <div class="dicon">
        <div class="didroll">3 </div>
        <div class="backing"> </div>
    </div>
</div>

Things are a little more messy for inline rolls. In the below example, we can see that all the roll information is contained in the original-title attribute of the .inlinerollresult span. The die type can be parsed from the 1d100 string, and the raw result can be found within the .basicdiceroll span. Note again that in this case we are interested in the 49 that was rolled, not the resulting 40 from a -9 being applied.

<span class="inlinerollresult showtip tipsy-n-right"
        original-title="Rolling 1d100 - 9 = (<span class="basicdiceroll">49</span>)-9">
    40
</span>

Note that messages may contain multiple rolls, and the two types of rolls may rarely be combined: a roll block containing an inline roll.

Using the above methods, we can associate a die roll with the author of the message, which can typically be found in the .by span. This span will not be present in consecutive messages by the same user, so if not present, the message is associated with the previous author that was identified.

Emote Messages

Further complications to identifying authorship arise through emote message - that is, messages of the following format:

August the Second shoots a fireball, dealing [[1d20]] damage.

These messages do not have any HTML information to uniquely identify the author, so we must use the text content itself, which begins with the character name. However, while as a human we can read the above and understand that the character is named August the Second, the parser has no immediate way to know which words should be included in the character name.

The parser attempts to get around this using the following strategy:

Save parsing of emote messages until after all other messages have been parsed.
When parsing regular messages in which character names can be identified, map avatar URLs (found in .avatar div) to character names. Note that to our advantage, users will by default likely have different avatars.
After the first round of parsing has been completed, parse emote messages separately and use avatar data to aid in identifying the character.

In the ideal case, a character is uniquely identified.
If multiple characters share an avatar, search the emote message for the longest character name match.
If no avatar mapping was found, search the emote messages for longest match among all known character names.
If no match is found, it likely means that the character only ever typed emote messages. This is rare, and the parser must fall back to assuming that the first word of the emote message is the character name.

How results are analyzed

How overall luck is calculated

The fundamental concept used across dice analysis is the average result of the die. This is equal to (S + 1) / 2, where S is the number of sides. On a six-sided die, or d6, for instance, the average roll would be (6 + 1) / 2 = 3.5. A 3.5 can never actually be rolled in a single roll of the d6, of course, but over many rolls, the average roll will approach this value if the d6 is truly random.

The above formula can be extended to rolling multiple dice. For example, if you roll two six-sided dice, 2d6, the average result would be 2(3.5) = 7. More generally, the average result for a dice roll can be said to be N(S + 1) / 2, where N is the number of dice and S is the number of sides of the die.

Using only these concepts, we can sum up all of the dice rolled across a campaign by a hypothetical character named Tim and calculate the average result of all these dice to determine whether Tim rolled above or below average.

When it comes to comparing the rolls of two different characters, though, we need a way to express how far from the average the result is, in a way that doesn't depend on the number of dice rolled or what types of dice they were. Otherwise, it wouldn't make sense: a character rolling only d100s will of course have a higher average than a character rolling only d6s.

This can be done using the Z score statistic, which does precisely that. It expresses the distance from the mean in terms of standard deviation, a normalized measure of how spread apart data is around its mean.

Calculating standard deviation for individual dice rolls and combining them into a single pooled standard deviation is beyond the scope of this explanation. For more reading on this topic, check out this article on dice variance by Analytics Check and this article on pooled standard deviation by Statistics How To.

Since the Z scores are normalized, they can be compared between characters. A character with a higher Z score will have on average rolled higher, and a character with a lower Z score will have on average rolled lower.

How dice fairness is calculated

Calculating the fairness of a die may require more of a statistics lecture than you might want to read. In summary, the goal is to determine the probability that the rolls you have observed have happened given a truly random die. For example, if you roll a six-sided die 600 times, you would expect that each face would be rolled roughly 100 times. If instead you roll a one 300 times, you might suspect that the die isn't being fair.

This idea of comparing observed and expected frequency is precisely what can be calculated with a chi square test. The higher the resulting chi square statistic of this test, the lower the chance that you would see your results with a truly random die.

Specifically, the test produces a p value that represents how likely your results are given the dice being random. A p value of 0.10, for instance, would mean that there's a 10% chance of rolling the results you got if the dice are fair.

There is no set threshold at which one can objectively say a die is unfair. In statistics, we more or less arbitrarily choose what is known as the critical value, a p value beyond which the results are said to be significant. One commonly used critical value is 0.05, and this is what is used by the aggregator.

That is to say, if the calculated p value of your rolls is 0.05 or lower, we reject the assumption that your dice are truly random. If the p value is greater than 0.05, we cannot however conclude that the dice are random. We can make no conclusions - we have simply not found a significant effect.

For more reading on this topic, check out this post by Ilmari Karonen.

Limitations

Although care was taken in allowing the aggregator to parse as many rolls as possible as accurately as possible, there are known limitations to the parser and this site as a whole:

Private messages and description messages are not parsed.
In rolls where multiple dice are rolled but only a certain number are kept, such as rolling with advantage in D&D 5e, all rolls are parsed.
Special rolls such as FATE dice or compounding dice may not be parsed correctly.
Individual die statistics can only be analyzed for standard die types: d2, d4, d6, d8, d10, d12, d20, d100.
In tabletop games, dice results may be good or bad in different contexts. Sometimes higher is better, sometimes higher is worse. The "Highest Roller" ranking may therefore not represent actual "luck"; it only pertains to raw values and makes no presumptions about whether the outcome is good or bad.
The site is intended for use on large, horizontal screens. It will not display correctly on mobile devices.

Because of the complexity of parsing, the aggregator cannot guarantee 100% coverage and accuracy.