Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches Paper • 2512.12677 • Published Dec 14, 2025 • 1
PatenTEB: A Comprehensive Benchmark and Model Family for Patent Text Embedding Paper • 2510.22264 • Published Oct 25, 2025 • 2
AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Paper • 2412.09796 • Published Dec 13, 2024 • 2
PaECTER: Patent-level Representation Learning using Citation-informed Transformers Paper • 2402.19411 • Published Feb 29, 2024 • 2
DAPFAM: A Domain-Aware Family-level Dataset to benchmark cross domain patent retrieval Paper • 2506.22141 • Published Jun 27, 2025 • 2
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs Paper • 2410.07009 • Published Oct 9, 2024 • 1
The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications Paper • 2207.04043 • Published Jul 8, 2022
Intelligent System for Automated Molecular Patent Infringement Assessment Paper • 2412.07819 • Published Dec 10, 2024 • 2
PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model Paper • 1906.02124 • Published May 14, 2019
From scratch to silver: Creating trustworthy training data for patent-SDG classification using Large Language Models Paper • 2509.09303 • Published Sep 11, 2025