-
-
Notifications
You must be signed in to change notification settings - Fork 12
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
47 changed files
with
81 additions
and
128 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config full --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/full | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config root-fc --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/root-fc | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config baseline --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/baseline | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-leaf --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/small-leaf | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-all --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/small-all | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-baseline --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/small-baseline | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-context --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/short-context | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-baseline --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/short-baseline | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=0 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
|
||
|
||
source slurm/env.sh | ||
|
@@ -23,27 +22,27 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config full --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/full | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config root-fc --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/root-fc | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config baseline --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/baseline | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-leaf --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/small-leaf | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-all --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/small-all | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-baseline --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/small-baseline | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-context --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/short-context | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-baseline --model-class claude --large-model claude-3-5-sonnet-20241022 --small-model claude-3-5-haiku-20241022 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/claude/short-baseline | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config full --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/full --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config root-fc --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/root-fc --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config baseline --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/baseline --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-leaf --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/small-leaf --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-all --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/small-all --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-baseline --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/small-baseline --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-context --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/short-context --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-baseline --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/short-baseline --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,27 +22,27 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config full --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/full --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config root-fc --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/root-fc --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config baseline --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/baseline --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-leaf --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/small-leaf --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-all --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/small-all --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config small-baseline --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/small-baseline --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-context --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/short-context --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config short-baseline --model-class cohere-hf --large-model CohereForAI/c4ai-command-r-plus-08-2024 --small-model CohereForAI/c4ai-command-r-08-2024 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/cohere-hf/short-baseline --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -11,7 +11,6 @@ | |
#SBATCH --gpus=8 | ||
#SBATCH [email protected] | ||
#SBATCH --mail-type=END,FAIL | ||
#SBATCH --nodelist=nlpgpu04,nlpgpu05,nlpgpu08 | ||
#SBATCH --constraint=48GBgpu | ||
|
||
source slurm/env.sh | ||
|
@@ -23,6 +22,6 @@ source slurm/webarena-env.sh | |
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
python bench_webarena.py --config full --model-class mistral --large-model mistralai/Mistral-Large-Instruct-2407 --small-model mistralai/Mistral-Small-Instruct-2409 --save-dir /nlpgpu/data/andrz/redel/experiments/webarena/mistral/full --engine-timeout 1800 | ||
curl -X GET ${RESTART_URL} | ||
bash slurm/webarena-startup.sh | ||
sleep 600 | ||
kill $DOCKER_PID |
Oops, something went wrong.