metadata
library_name: transformers
license: mit
language:
- en
metrics:
- f1
- precision
- recall
base_model:
- gpt2-medium
pipeline_tag: text-classification
GPT-2 medium for classifying API reviews
This model classifies API reviews in developer forums (e.g., Stack Overflow) as 'usability', 'others', 'onlysentiment', 'bug', 'performance', 'community', 'documentation', 'compatibility', 'legal', 'portability' or 'security'.
- Developed by: Fabian C. Peña, Steffen Herbold
- Finetuned from: gpt2-medium
- Replication kit: https://github.com/aieng-lab/senlp-benchmark
- Language: English
- License: MIT
Citation
@misc{pena2025benchmark,
author = {Fabian Peña and Steffen Herbold},
title = {Evaluating Large Language Models on Non-Code Software Engineering Tasks},
year = {2025}
}