Skip to content

Pull requests: kubernetes-sigs/inference-perf

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Add unit tests for the split config package cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#520 opened May 28, 2026 by Bslabe123 Contributor Draft
Refactor and improve max-model-len truncation to be more efficient approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#512 opened May 28, 2026 by achandrasekar Contributor Loading…
Add K8s Slack invitation link cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#510 opened May 27, 2026 by tico88612 Member Loading…
Feat: Add filtering support for OTel trace replay across all data sources cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#507 opened May 24, 2026 by lenadankin Contributor Loading…
Split config file approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#505 opened May 22, 2026 by Bslabe123 Contributor Loading…
Inject session identity header for session replay requests cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#504 opened May 22, 2026 by pavanipenumalla Loading…
Update OWNERS_ALIASES approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/invalid-owners-file Indicates that a PR should not merge because it has an invalid OWNERS file in it. size/XS Denotes a PR that changes 0-9 lines, ignoring generated files.
#503 opened May 22, 2026 by achandrasekar Contributor Loading…
Emit Prometheus merics for runtime observability cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#501 opened May 21, 2026 by Bslabe123 Contributor Loading…
[WIP] Add config for constraining media pool cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#500 opened May 21, 2026 by Bslabe123 Contributor Draft
feat: add reasoning output support for OTEL trace replay cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#499 opened May 21, 2026 by oritht Contributor Loading…
[conversation_replay] force min_tokens == max_tokens for deterministic output length cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/M Denotes a PR that changes 30-99 lines, ignoring generated files.
#497 opened May 20, 2026 by LoganVegnaSHOP Contributor Loading…
[WIP] Add support for ShareGpt4Video Dataset cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#494 opened May 18, 2026 by Bslabe123 Contributor Draft
regenerate system prompts per stage in conversation replay cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#480 opened May 14, 2026 by zetxqx Contributor Loading…
[WIP] Add MMMU Dataset cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#478 opened May 12, 2026 by Bslabe123 Contributor Draft
[WIP] Autogenerate config docs cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#474 opened May 12, 2026 by Bslabe123 Contributor Draft
Emit native llm-d-benchmark v0.2 partial reports alongside existing reports cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. do-not-merge/invalid-commit-message Indicates that a PR should not merge because it has an invalid commit message. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#461 opened Apr 29, 2026 by Bslabe123 Contributor Loading…
Security: Archive extraction vulnerable to path traversal (TarSlip) cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/S Denotes a PR that changes 10-29 lines, ignoring generated files.
#451 opened Apr 25, 2026 by tomaioo Loading…
[WIP] feat: Implement distributed Redis-based load generator approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#438 opened Apr 14, 2026 by jjk-g Collaborator Loading…
[WIP] Add Expressions API cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#423 opened Apr 9, 2026 by Bslabe123 Contributor Draft
[WIP] Add --url Flag and Config Autofilling Logic cncf-cla: no Indicates the PR's author has not signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#384 opened Apr 1, 2026 by Bslabe123 Contributor Draft
[WIP] Cleanup Prometheus Metric Querying cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#382 opened Apr 1, 2026 by Bslabe123 Contributor Draft
Shared Prefix Trace Replay & Tree-of-Thought Generation cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. size/XL Denotes a PR that changes 500-999 lines, ignoring generated files.
#369 opened Mar 25, 2026 by diamondburned Contributor Loading…
Add wg-sreving serving catalog approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. lgtm "Looks good to me", indicates that a PR is ready to be merged. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#368 opened Mar 24, 2026 by jjk-g Collaborator Loading…
[WIP] Fix saturation detection and harden load generator cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files.
#360 opened Mar 2, 2026 by Bslabe123 Contributor Draft
fix: handle ShareGPT dataset exhaustion by reinitializing iterator cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.
#359 opened Feb 27, 2026 by DebuggingMax Loading…
ProTip! Add no:assignee to see everything that’s not assigned.