Welcome to the exciting world of arxiv-sanity-lite, an innovative platform that helps researchers stay updated with the latest papers in their field. In this guide, we will walk you through the steps to set up your own instance of this powerful tool. Let’s dive in!
Overview of arxiv-sanity-lite
arxiv-sanity-lite is a minimalistic tool that periodically polls the arXiv API to fetch new academic papers. It uses Support Vector Machines (SVM) over TF-IDF features extracted from paper abstracts to recommend new papers based on custom tags you create. With a friendly web UI, users can search, rank, sort, and slice data effortlessly. Don’t miss out on any developments in your area—enjoy daily email recommendations based on your tags!
Setting Up Your Local Environment
Running arxiv-sanity-lite locally requires a few simple steps. Below is a structured approach to get you started:
1. Update the Database with New Papers
To keep your database fresh, use the below script to periodically update it:
bash
#!/bin/bash
python3 arxiv_daemon.py --num 2000
if [ $? -eq 0 ]; then
echo New papers detected! Running compute.py
python3 compute.py
else
echo No new papers were added, skipping feature computation
fi
This script accomplishes two main tasks: it gathers new papers from the arXiv API and computes the necessary features for them.
2. Run the Flask Server
To serve the application, execute this command:
bash
export FLASK_APP=serve.py; flask run
Your entire database will be stored inside the data directory, ready for access!
3. Hosting on Linode
If you want to make your instance available on the internet, consider hosting it on a Linode server. A modest Nanode 1 GB instance is ideal for indexing about 30K papers at just $5 per month.
4. Set Up Daily Email Recommendations (Optional)
If you’d like your users to receive daily paper recommendations, you can leverage the send_emails.py script. Remember to install SendGrid with the following command:
bash
pip install sendgrid
You can then set this script to run daily as a cron job.
System Requirements
Before diving in, install the necessary packages with:
bash
pip install -r requirements.txt
Troubleshooting
If you encounter any issues while running your instance, consider the following troubleshooting tips:
- Ensure that your Python and Flask versions are compatible.
- Check your API keys and permissions if you face issues with fetching new papers.
- Inspect the logs for any errors in the database update or feature computation scripts.
For more insights, updates, or to collaborate on AI development projects, stay connected with fxis.ai.
Future Enhancements
The development team is constantly working on improvements. Here are some upcoming features:
- Making the website mobile-friendly using media queries in CSS.
- Optimizing the metas table for efficiency.
- Implementing a reverse index for faster searches.
At fxis.ai, we believe that such advancements are crucial for the future of AI, as they enable more comprehensive and effective solutions. Our team is continually exploring new methodologies to push the envelope in artificial intelligence, ensuring that our clients benefit from the latest technological innovations.
In Conclusion
With this guide, you have all the information needed to set up your own instance of arxiv-sanity-lite easily. Stay current with the academic tide, boost your research, and enjoy all the new developments in your field!

