regenerate system prompts per stage in conversation replay#480
Conversation
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: zetxqx The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
@alonh @LoganVegnaSHOP @Navjot10 Any thoughts on the behavior here? |
7ea8a09 to
eb304b5
Compare
9dadb00 to
44abf8f
Compare
|
This looks good overall — the cross-stage regeneration approach makes sense. The seed derivation in load_lazy_data uses f"{self.cr_config.seed}stage{data.stage_idx}" without any per-conversation component. When dynamic_system_prompt_len is configured, each conversation originally gets a unique suffix at init time. But on stage transitions, all conversations derive the same RNG from the same seed string. Wouldn't that cause all conversations to collapse to the same prompt, losing the per-conversation uniqueness? Nit: the new field is called stage_idx but the rest of the codebase uses stage_id — would be nice to keep consistent. |
Good point
|
3830dee to
f7ba153
Compare
Fixes: #479
This PR updates the ConversationReplayDataGenerator to automatically regenerate system prompts at the start of every new benchmark stage.