Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]: Fail to reproduce llmlingua on meetingbank #171

Open
jzhang538 opened this issue Jul 27, 2024 · 1 comment
Open

[Question]: Fail to reproduce llmlingua on meetingbank #171

jzhang538 opened this issue Jul 27, 2024 · 1 comment
Assignees
Labels
question Further information is requested

Comments

@jzhang538
Copy link

Describe the issue

Thanks for the interesting work. I tried to reproduce the results of llmlingua on the meetingbank QA dataset with Mistral-7B as the target LLM.

The small LLM I use is https://huggingface.co/NousResearch/Llama-2-7b-hf

However, the results seem much lower than the reported results in Table 4 of llmlingua2 (around 20 than 50.45 in the paper). Here is my implementation:

compressor = PromptCompressor(
model_name=args.model_name,
model_config={},
use_llmlingua2=False
)

iterative_size = 200
comp_dict = compressor.compress_prompt(
context=origin,
instruction="",
question="",
rate=args.compression_rate,
iterative_size=iterative_size,
context_budget="*2.0",
)

I'm wondering if there is any issue with my implementation?

@jzhang538 jzhang538 added the question Further information is requested label Jul 27, 2024
@pzs19
Copy link
Contributor

pzs19 commented Jul 30, 2024

Hi, @jzhang538, thank you for raising the question!

I think there are two reasons that may lead to this issue. The first is the parameters of LLMLingua, such as iterative_size or context_budget. The second is the evaluation. Note that we do not use the instruct version of Mistral in experiment, the model may generate lengthy responses and even raise similar questions in the response, which leads to a low performance. So it is necessary to truncate the responses at an appropriate place.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants