Interface for easier topic modelling.
-
Updated
Jul 29, 2024 - Python
Interface for easier topic modelling.
Python implementation of bag-of-concepts
Library in C++ and a python wrapper for dealing with Page XML files
An extremely weakly-supervised text classification method using mutually-enhancing text granularities (word, sentence, and document-level context).
Hyperbolic Contrastive Learning for Document Representations - A Multi-View Approach with Paragraph-level Similarities
Dataset and code for "Label-Wise Document Pre-Training for Multi-Label Text Classification" (NLPCC 2020)
Unsupervised Discovery Of Trends In Biomedical Research Based On The PubMed Baseline Repository
Document representations in the Vector Space Model using multiple weighting schemes.
Define models to represent a textual document, e.g. a PDF, preserving the hierarchy of the content.
Work done with a teammate as part of the graduate PLDAC course at Sorbonne University
Rethinking Graph-Based Document Classification.
Add a description, image, and links to the document-representation topic page so that developers can more easily learn about it.
To associate your repository with the document-representation topic, visit your repo's landing page and select "manage topics."