Skip to content

WASP-SYSTEMS/crs_exercise

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Task

Your task is to implement a simple CRS consisting of one LLM tool-calling agent. To make things easier all the challenge project interaction is already implemented, so you do not have to read/write files in the project yourself (see minimal_crs.py for details).

Your agent is only supposed to patch the vulnerability. So, you can assume a vulnerability was already found in the source code.

Requirements

Me, when I have to build yq from source: (ノ ゜Д゜)ノ ︵ ┻━┻

(Maybe you can find the right pre-build binaries at the release page.)

Setup

Please make sure that your user is added to the docker group and you do not have to execute docker using sudo, because the helper scripts we use call docker without sudo.

Execute the following commands in the root directory of this project:

Setup python environment:

python3 -m venv venv
source ./venv/bin/activate
pip install -r requirements.txt

Setup challenge project:

git clone -b crs_exercise --single-branch https://github.com/WASP-SYSTEMS/mock-cp
cd mock-cp
make cpsrc-prepare
make docker-config-local
make docker-build

To test your setup run python3 test_setup.py.

Export OpenAI API-key:

export OPENAI_API_KEY=<your key>

You will receive a (temporary) API key from us after you send a minimal (pseudocode) implementation to [email protected].

Now you are ready implement your agent in minimal_crs.py.

NOTE: The only files which are relevant to you are minimal_crs.py and public methods of the class AixccProject in project/project.py. You do not have to look at any other files! All source code interaction is done for you through the AixccProject class.

NOTE: Execute your the CRS by running python3 minimal_crs.py in the root directory of this project to prevent import errors.

Helpful links

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages