✨CC30k: A Citation Contexts Dataset for Reproducibility-Oriented Sentiment Analysis
📝 Summary:
CC30k is a new dataset of 30,000 machine learning paper citation contexts, labeled with reproducibility-oriented sentiments. It enables large language models to better predict paper reproducibility, filling a crucial gap in computational reproducibility studies.
🔹 Publication Date: Published on Nov 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07790
• PDF: https://arxiv.org/pdf/2511.07790
✨ Datasets citing this paper:
• https://huggingface.co/datasets/rochanaro/CC30k
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#MachineLearning #Reproducibility #LLM #SentimentAnalysis #DataScience
📝 Summary:
CC30k is a new dataset of 30,000 machine learning paper citation contexts, labeled with reproducibility-oriented sentiments. It enables large language models to better predict paper reproducibility, filling a crucial gap in computational reproducibility studies.
🔹 Publication Date: Published on Nov 11
🔹 Paper Links:
• arXiv Page: https://arxiv.org/abs/2511.07790
• PDF: https://arxiv.org/pdf/2511.07790
✨ Datasets citing this paper:
• https://huggingface.co/datasets/rochanaro/CC30k
==================================
For more data science resources:
✓ https://t.iss.one/DataScienceT
#MachineLearning #Reproducibility #LLM #SentimentAnalysis #DataScience
❤1