Welcome to our guide on installing and setting up the Spider2-V environment and interacting with multimodal agents for automating data science and engineering workflows. This installation process may seem daunting, but we’ll walk you through it step by step, making it user-friendly for everyone!
What is Spider2-V?
Spider2-V is a project designed to create automated solutions for data science and engineering tasks. Think of it like equipping a robot with the manuals to perform various tasks – it learns how to globally automate workflows effectively.

For additional resources, check out the following:
Installation Guide
This section explains how to install Spider2-V on your desktop or server.
Step 1: Set Up Your Environment
- Clone the repository:
git clone https://github.com/xlang-ai/Spider2-V
cd Spider2-V
conda create -n spider2v python=3.11
conda activate spider2v
pip install -r requirements.txt
Step 2: Install Virtualization Software
To run the virtual machine environment:
- Install VMware Workstation Pro (or VMware Fusion for Apple Chips).
- Append the vmrun command into your system path to ensure successful installation. Verify with:
vmrun -T ws list
Note: VirtualBox will also be supported soon if you run into issues with VMware.
Step 3: Install Additional Dependencies
Run the following command to set up Playwright:
playwright install chromium
Quick Start
After setting up the environment, you can run a minimal example to interact with it:
python
from desktop_env.envs.desktop_env import DesktopEnv
example_path = 'evaluation_examples/examples/dagster/example.json'
with open(example_path, 'r') as infile:
example = json.load(infile)
env = DesktopEnv(action_space=pyautogui)
obs = env.reset(task_config=example)
print(f'Task instruction: {example["instruction"]}')
obs, reward, done, info = env.step(pyautogui.rightClick())
input('Now, you can finish the task in the virtual machine manually and Press ENTER to evaluate ...')
score = env.evaluate()
print(f'Evaluation score: {float(score):.1f}')
env.close()
Think of this code as sending an operator into a factory. The operator has a checklist (task instruction), and interacts with machines (environment) to execute tasks. Once the operator clicks on a button (right click), they get feedback on how well they performed (evaluation score).
Troubleshooting
If you encounter issues during installation or execution:
- Ensure all versions and dependencies are correctly configured, especially Python (3.11).
- Double-check the installation of VMware or VirtualBox.
- If problems arise, revisit the installation documentation or the community forums for support.
For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.
Conclusion
At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.