metadata

library_name: transformers
license: mit
language:
  - en
metrics:
  - f1
  - precision
  - recall
base_model:
  - gpt2-medium
pipeline_tag: text-classification

GPT-2 medium for classifying API reviews

This model classifies API reviews in developer forums (e.g., Stack Overflow) as 'usability', 'others', 'onlysentiment', 'bug', 'performance', 'community', 'documentation', 'compatibility', 'legal', 'portability' or 'security'.

Developed by: Fabian C. Peña, Steffen Herbold
Finetuned from: gpt2-medium
Replication kit: https://github.com/aieng-lab/senlp-benchmark
Language: English
License: MIT

Citation

@misc{pena2025benchmark,
  author    = {Fabian Peña and Steffen Herbold},
  title     = {Evaluating Large Language Models on Non-Code Software Engineering Tasks},
  year      = {2025}
}