Merge eugeneia/snabb:timeline-raptorjit into Vita #65

eugeneia · 2018-12-07T15:05:51Z

This builds on @lukego’s work on the timeline log, a probabilistic flight recorder for Snabb, see: snabbco#849 snabbco#873 snabbco#916 snabbco#973 snabbco#1011 snabbco#1098 snabbco#1112

My current working branch for the timeline is at eugeneia/snabb:timeline-raptorjit, this branch merges this feature into Vita and adds some application specific events (user events).

See #58 for some example plots.

This is a very useful instruction for self-benchmarking programs that want to read the CPU timestamp counter efficiently. See Intel whitepaper for details: http://www.intel.com/content/dam/www/public/us/en/documents/white-papers/ia-32-ia-64-benchmark-code-execution-paper.pdf

Use 'double' instead of 'uint64_t' for values in the timeline file. This change is motivated by making timeline files easier to process by R. In the future we may switch back to uint64_t for the TSC counter and/or argument values for improved precision. The major_version file header field can be used to avoid confusion. The obvious downside to using doubles is that the TSC value will lose precision as the server uptime increases (the TSC starts at zero and increases at the base frequency of the CPU e.g. 2GHz.) The impact seems to be modest though. For example a 2GHz CPU would start rounding TSC values to the nearest 128 (likely quite acceptable in practice) after approximately 2 years of operation (2^53 * 128 cycles.) So - storing the TSC as a double-float is definitely a kludge - but unlikely to cause any real harm and expedient for the short-term goal of putting this code to use without getting blocked due to e.g. my lack of sophisticated as an R hacker.

Resolved conflict in app.lua between adding timeline events and the new breath topological-sort machinery.

…x-next-opt

Simplify the code and eliminate unwanted branches from the engine loop by drawing a random timeline level from a log-uniform distribution that mathematically favors higher log levels over lower ones. Plucked log5() out of the air i.e. each log level should be enabled for 5x more breaths than the one below. Here is how the distribution of log level choice looks in practice using this algorithm: > t = {0,0,0,0,0,0,0,0,0} > for i = 1, 1e8 do local n = math.max(1,math.ceil(math.log(math.random(5^9))/math.log(5))) t[n] = t[n]+1 end > for i,n in ipairs(t) do print(i,n) end 1 560 2 2151 3 10886 4 55149 5 273376 6 1367410 7 6844261 8 34228143 9 171120244 Note: Lua provides only natural logarithm functions but it is easy to derive other bases from this (google "log change of base formula").

I suspect that it is a misfeature for the timeline to sample the contents of packets. Do we really want user data potentially appearing in debug logs? Removed for now.

Cleanup timeline integration in core.app a little along the merge.

…aptorjit

…les" This reverts commit 0e52b24.

This fixes a bug where timeline log level was rerolled between end of breaths but before before post-breath events, causing sampling to affect the event lag of the polled_timers event.

Changes the syntax of event specs to <level>,<rate>|<eventname>: ... The previous level digit becomes the event’s "rate" and retains its semantics with regard to the logging frequency of the specified event. The "stack depth" of the event is now decoupled as the new, leading level digit and specified independently. The new level semantics are as follows: - level ranges from 0-9 (10 levels in total) - 0 is the top most level while 9 in the lowest - levels 0-4 are reserved for use by the engine - user applications can use levels 5-9 to create hierarchy in their events Caveat: users should avoid defining events with a higher level and a lower event rate than an enclosed event if the higher level event is supposed to serve as a latency anchor for the lower level event. RIGHT WRONG 5,3|op_start: 5,2|op_start: 6,2|op_iter: 6,3|op_iter: 5,3|op_end: 5,2|op_end: In the left most WRONG example, the anchor of the op_inter event depends on the log rate at the time of sampling.

randomize_log_rate will then set the rate to values between 1 and 6, enabling the respective events.

Realized: 9 is the rate that means "log always". Elevate some rare but important events to rate 9.

…main

lukego and others added 30 commits November 8, 2016 09:34

Merge v2016.11 branch 'snabbco/master' into next

773cdcd

core.timeline: New module for "timeline" logs

95872b1

Makefile: Embed "*.events" timeline specs

930f294

engine: Add engine.events, app.events, link.events

aa9980a

core.link: Add dropped bytes counter

8530942

engine: Instrument with timeline events

9b252f0

Merge branch 'next' into timeline-redux-next

945efe1

Merge snabbco/next into timeline-redux-next

276e3c5

Resolved conflict in app.lua between adding timeline events and the new breath topological-sort machinery.

engine: update timeline level probabilities

563ae7a

Merge remote-tracking branch 'wingo/topo-link-sort' into timeine-redu…

3bb71fa

…x-next-opt

Merge v2017.04 branch 'master' into timeline-redux

4f94b55

engine: Remove timeline packet payload sampling

16a6bb2

I suspect that it is a misfeature for the timeline to sample the contents of packets. Do we really want user data potentially appearing in debug logs? Removed for now.

Merge tag 'v2018.04' into timeline

596d6b5

Merge remote-tracking branch 'snabbco/raptorjit' into timeline-raptorjit

9c87769

Merge remote-tracking branch 'snabbco/raptorjit' into timeline-raptorjit

2604715

Cleanup timeline integration in core.app a little along the merge.

Merge remote-tracking branch 'snabbco/master' into timeline-raptorjit

615a8c8

core.app: remove remains from 16a6bb2 (timeline packet sampling)

1bb4108

core.timeline: make timeline log available to other core modules

0e52b24

core.packet: record packet allocation events to timeline

e834845

Merge remote-tracking branch 'origin/mp-packet-reuse' into timeline-r…

07b00f2

…aptorjit

Revert "core.timeline: make timeline log available to other core modu…

a6def7b

…les" This reverts commit 0e52b24.

lib.ptree.worker: reformulate to use engine.main with done func

8201d1d

engine.main: fix memory leak due to latency histogram creation

debd26c

Merge branch 'timeline-raptorjit' into vita-timeline2

aa2b0ed

core.app: set timeline log level at the very end of breathe loop

edaad6c

This fixes a bug where timeline log level was rerolled between end of breaths but before before post-breath events, causing sampling to affect the event lag of the polled_timers event.

core.packet: reorder group_fl released/reclaimed/unlocked events

9c4c804

eugeneia added 14 commits November 8, 2018 12:10

Merge branch 'timeline-raptorjit' into vita-timeline2

32c3751

lib.ipsec.esp: added low-level timeline events

4166c0e

vita: added low level timeline events for packet dispatch and routing

198db25

lib.ptree.worker: fixup to 8201d1d

217760b

core.app: initialize timeline log rate to 7 (no logging)

8e68109

randomize_log_rate will then set the rate to values between 1 and 6, enabling the respective events.

Merge branch 'timeline-raptorjit' into vita-timeline2

9c2faf3

core.timeline: add switch to disable timeline logging

8f0c6bf

Merge branch 'timeline-raptorjit' into vita-timeline2

ce780a9

core.app: add timeline events for app network configuration

30cafdc

Merge branch 'timeline-raptorjit' into vita-timeline2

1a6aad4

Fix some wrong assumptions about the event rate

e5faba0

Realized: 9 is the rate that means "log always". Elevate some rare but important events to rate 9.

Merge branch 'timeline-raptorjit' into vita-timeline2

8a32233

ptree.worker: emit engine_stopped/engine_started events when pausing …

5b06215

…main

Merge branch 'vita' into vita-timeline2

c7a2106

eugeneia added a commit that referenced this pull request Mar 8, 2019

Merge PR #65 (Timeline) into vita-next

a72c364

eugeneia mentioned this pull request Mar 8, 2019

Merge vita-next #88

Merged

eugeneia merged commit c7a2106 into inters:master Mar 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge eugeneia/snabb:timeline-raptorjit into Vita #65

Merge eugeneia/snabb:timeline-raptorjit into Vita #65

eugeneia commented Dec 7, 2018

Merge eugeneia/snabb:timeline-raptorjit into Vita #65

Merge eugeneia/snabb:timeline-raptorjit into Vita #65

Conversation

eugeneia commented Dec 7, 2018