Patent NLP - a cfcooney Collection

cfcooney 's Collections

Patent NLP

updated Feb 17

A collection of machine learning and natural language processing papers related to the patent domain.

Upvote

Patent Language Model Pretraining with ModernBERT

Paper • 2509.14926 • Published Sep 18, 2025 • 1
Fine-Tuning Causal LLMs for Text Classification: Embedding-Based vs. Instruction-Based Approaches

Paper • 2512.12677 • Published Dec 14, 2025 • 1
PatenTEB: A Comprehensive Benchmark and Model Family for Patent Text Embedding

Paper • 2510.22264 • Published Oct 25, 2025 • 2
AutoPatent: A Multi-Agent Framework for Automatic Patent Generation

Paper • 2412.09796 • Published Dec 13, 2024 • 2
PaECTER: Patent-level Representation Learning using Citation-informed Transformers

Paper • 2402.19411 • Published Feb 29, 2024 • 2
DAPFAM: A Domain-Aware Family-level Dataset to benchmark cross domain patent retrieval

Paper • 2506.22141 • Published Jun 27, 2025 • 2
Pap2Pat: Benchmarking Outline-Guided Long-Text Patent Generation with Patent-Paper Pairs

Paper • 2410.07009 • Published Oct 9, 2024 • 1
The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications

Paper • 2207.04043 • Published Jul 8, 2022
Patent Representation Learning via Self-supervision

Paper • 2511.10657 • Published Nov 3, 2025
Intelligent System for Automated Molecular Patent Infringement Assessment

Paper • 2412.07819 • Published Dec 10, 2024 • 2
PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model

Paper • 1906.02124 • Published May 14, 2019
From scratch to silver: Creating trustworthy training data for patent-SDG classification using Large Language Models

Paper • 2509.09303 • Published Sep 11, 2025

Upvote