Skip to content

[Question]: Prompt templates for LLaVA-OneVision and Qwen3-72B instruction generation? #359

Description

@pqnhoang

Question

Hi, I'm reproducing the VLN-N1 instruction pipeline (keyframes → sub-clips → LLaVA-OneVision → Qwen3-72B rewrite/summarize) from your tech report. Could you share the exact prompts and #images per sub-clip for LLaVA and Qwen, plus how you define turn left/right in the prompt? I'm getting left/right mismatches vs. our trajectory. Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions