A couple of years ago we developed the PostTagger - an internal tool that helps to classify text data using LLM by sending a request for each text in the dataset. It was written in full Python (somehow) with the Dash framework, which allows to build dashboards without JS. However, it works with simple dashboards only, and with more and more features, code, performance, and visuals became a bit of a mess.
The goal of this internship is to implement a backend for a new version of the app that will allow a better-looking and snappier frontend. Since each run of the tool takes a lot of time, the main challenge of the project is to implement a robust progress system. We should allow users to close the app and still calculate the result, update the progress bar at the frontend, and continue the run if something goes wrong. Probably some combination of Celery and WebSockets will be used, but it may change depending on what might be the optimal solution.
This intersnhip will give you an experince of writing production-quality code, communicating with the whole team (albeit a small one) and learning some popular frameworks along the way.
Python
Docker
SQL
English
A nice bonus would be an experience with developing a web backend