VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning
Paper
•
2503.15438
•
Published
•
4
This repository provides the tokenizer used in the VenusFactory platform, described in VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning. VenusFactory is a unified platform for protein engineering that integrates data retrieval, standardized task benchmarking, and modular fine-tuning of protein language models (PLMs).
Code and further details are available at: https://github.com/tyang816/VenusFactory