hank
|
c157f38957
|
gpu: Add closure for Jetson and improve compatibility
|
2025-01-24 22:07:37 -05:00 |
|
Links
|
d185dfdef8
|
get Jetson GPU Information
|
2025-01-24 19:17:33 -05:00 |
|
Henry Dollman
|
1ac165d7d3
|
include stats in error log when encoding stats fails
|
2025-01-05 17:58:38 -05:00 |
|
Henry Dollman
|
8e531e6b3c
|
fix: handle duplicate GPU names (#361)
|
2025-01-05 16:40:22 -05:00 |
|
Henry Dollman
|
b08219dacf
|
refactor agent gpu code to make it easier to add intel / jetson
|
2024-12-17 17:12:58 -05:00 |
|
Henry Dollman
|
b4bc8a31aa
|
add check / reset for invalid disk i/o rates
|
2024-11-24 15:56:12 -05:00 |
|
Henry Dollman
|
4cb7b97416
|
change podman socket path to use current uid
|
2024-11-12 18:14:43 -05:00 |
|
Henry Dollman
|
b1db450e00
|
enable gpu monitoring by default
|
2024-11-12 18:13:57 -05:00 |
|
Henry Dollman
|
2e8ac98924
|
Improve disk discovery slightly by checking partition labels
|
2024-11-12 18:11:44 -05:00 |
|
Henry Dollman
|
3cd11d6bc4
|
improve podman support (#211)
|
2024-11-12 11:59:56 -05:00 |
|
Henry Dollman
|
03de73560c
|
add gpu power consumption chart
|
2024-11-08 20:31:22 -05:00 |
|
Henry Dollman
|
cd10727795
|
gpu usage and vram charts
|
2024-11-08 18:00:30 -05:00 |
|
Henry Dollman
|
8262a9a45b
|
progress on gpu metrics
|
2024-11-08 16:52:50 -05:00 |
|
Henry Dollman
|
655bfc95ca
|
add ability to specify partition for extra disk using folder name
|
2024-11-04 20:52:27 -05:00 |
|
Henry Dollman
|
741575df15
|
revert tweaks for old docker. needs more testing.
|
2024-11-02 14:43:35 -04:00 |
|
Henry Dollman
|
df0f3a154f
|
rtl layout progress and updates to arabic translations
|
2024-10-31 16:48:28 -04:00 |
|
Henry Dollman
|
f8fc74116c
|
rm *sensors.Warnings conversion - gopsutil windows uses different type
|
2024-10-26 14:02:19 -04:00 |
|
Henry Dollman
|
4094df3a61
|
fix: skip temperature collection if SENSORS is empty string (#196)
|
2024-10-24 15:10:20 -04:00 |
|
Henry Dollman
|
4a78ce1b16
|
skip temperatures code if sensors whitelist is set to empty string
|
2024-10-23 18:37:38 -04:00 |
|
Henry Dollman
|
539c0ccb1d
|
retry failed containers separately so we can run them in parallel (#58)
|
2024-10-21 17:00:13 -04:00 |
|
Henry Dollman
|
b5c158d1b3
|
update debug logs
|
2024-10-19 18:12:25 -04:00 |
|
Henry Dollman
|
8bf7a0e1d6
|
add DOCKER_TIMEOUT env var
|
2024-10-19 16:33:33 -04:00 |
|
Henry Dollman
|
ee92e338cb
|
update debug log locations
|
2024-10-16 18:12:43 -04:00 |
|
Henry Dollman
|
59d541dd1d
|
fix edge case overwriting extra filesystem with root io fallback
|
2024-10-16 15:26:12 -04:00 |
|
Henry Dollman
|
6c31263e60
|
add bandwidth alerts
|
2024-10-12 17:22:25 -04:00 |
|
Henry Dollman
|
6cf6661f2e
|
raise docker client timeout to 8 seconds if version <= 24
|
2024-10-12 12:24:53 -04:00 |
|
Henry Dollman
|
5b0fac429b
|
move update functions to agent / hub packages
|
2024-10-10 18:36:01 -04:00 |
|
Henry Dollman
|
efca56ceca
|
add temp debug logs to troubleshoot #196
|
2024-10-10 18:28:24 -04:00 |
|
Henry Dollman
|
64f0a23969
|
move fsStats creation to NewAgent function
|
2024-10-10 18:18:57 -04:00 |
|
Henry Dollman
|
76cea9d3c3
|
increase docker client timeout to 2100ms
|
2024-10-08 19:17:03 -04:00 |
|
Henry Dollman
|
73aae62c2e
|
add ZFS ARC memory accounting
|
2024-10-05 18:07:42 -04:00 |
|
Henry Dollman
|
af4877ca30
|
add MEM_CALC env var
|
2024-10-05 15:29:27 -04:00 |
|
Henry Dollman
|
c407fe9af0
|
exclude sensor if temp <=0 || temp >= 200
|
2024-10-05 11:14:20 -04:00 |
|
Henry Dollman
|
66cc0a4b24
|
log stats on startup if log level is debug
|
2024-10-02 19:58:02 -04:00 |
|
Henry Dollman
|
f051f6a5f8
|
add dockerManager / fix for Docker 24 and older
* dockerManager now handles all docker api interaction and container metrics tracking
* sets unlimited concurrency for docker 24 and older
|
2024-10-02 19:45:26 -04:00 |
|
Henry Dollman
|
45e1283b83
|
move system.Info to Agent struct
* cleaner to store entire info struct rather than separate properties for unchanging values
|
2024-10-02 12:34:42 -04:00 |
|
Henry Dollman
|
9ab359d3cf
|
add SENSORS env var
|
2024-09-29 16:36:32 -04:00 |
|
Henry Dollman
|
268e364bd4
|
update MemoryStats type
|
2024-09-29 12:36:19 -04:00 |
|
Henry Dollman
|
dd84a9fd35
|
remove semaphore and limit docker host connections to 10
|
2024-09-29 12:30:30 -04:00 |
|
Henry Dollman
|
2f4e537f72
|
change containerStatsMutex to a RWMutex
|
2024-09-28 19:13:24 -04:00 |
|
Henry Dollman
|
9637363cf3
|
combine container.Stats and container.PrevContainerStats
|
2024-09-28 18:51:46 -04:00 |
|
Henry Dollman
|
73d0dd25ec
|
agent refactoring - create agent/docker.go, agent/system.go
|
2024-09-28 17:49:04 -04:00 |
|
Henry Dollman
|
2ecf5572ba
|
remove addr, pubKey fields from agent struct
|
2024-09-28 16:48:55 -04:00 |
|
Henry Dollman
|
5e97167ee0
|
Fetch kernel, hostname, cpu at start rather than every run
|
2024-09-28 16:38:52 -04:00 |
|
Henry Dollman
|
1a4862ecd9
|
remove containerStats mutex and add stats by index
|
2024-09-27 19:06:37 -04:00 |
|
Henry Dollman
|
4694642674
|
add apiContainerList variable to reduce memory allocations
|
2024-09-27 16:10:31 -04:00 |
|
Henry Dollman
|
56c0b86025
|
rename variables for clarity
|
2024-09-27 14:56:53 -04:00 |
|
Henry Dollman
|
82e3f3c7c1
|
Fix temperature sensors not reporting if any sensor lacks valid data (#167)
|
2024-09-27 13:10:13 -04:00 |
|
Henry Dollman
|
cc32b50d82
|
add agent.debug and comments
|
2024-09-27 12:17:19 -04:00 |
|
Henry Dollman
|
764e043e83
|
add slog and LOG_LEVEL to agent
|
2024-09-26 20:07:35 -04:00 |
|