QualityEval
📉
8
End-to-End evaluation of Python and Java code quality.
AI-generated code, secure code generation, software security, vulnerability detection, static analysis, exploit generation, data poisoning, robustness evaluation, semantic correctness checking, symbolic execution, trustworthy AI, open-source LLMs, reproducible AI evaluation, AI safety, adversarial testing, software engineering datasets, dependable systems, model auditing, secure inference pipelines.