Skip to content
View pdufter's full-sized avatar

Highlights

  • Pro

Organizations

@cisnlp

Block or report pdufter

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Robust recipes to align language models with human and AI preferences

Python 4,973 427 Updated Nov 21, 2024

An Extensible Deep Learning Library

Python 1,937 287 Updated Feb 11, 2025

A Python + iCloud wrapper to access iPhone and Calendar data.

Python 59 24 Updated Jan 28, 2023
Python 13 Updated Apr 16, 2021

Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting has…

Python 664 90 Updated Feb 27, 2024

Jina examples and demos to help you get started

Python 457 142 Updated Nov 1, 2021

☁️ Build multimodal AI applications with cloud-native stack

Python 21,292 2,216 Updated Dec 20, 2024

An open-registry for hosting Jina executors via container images

Python 108 48 Updated Aug 31, 2021

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Python 1,292 224 Updated Mar 23, 2022

This repo supports various cross-lingual transfer learning & multilingual NLP models.

Python 92 6 Updated Sep 13, 2023

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

Python 1,107 130 Updated Aug 28, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,727 162 Updated Aug 18, 2024
Python 25 5 Updated Jan 22, 2024

A word2vec negative sampling implementation with correct CBOW update.

C++ 261 18 Updated Nov 8, 2021

Acceptance rates for the major AI conferences

Jupyter Notebook 4,365 308 Updated Jan 24, 2025

LibKGE - A knowledge graph embedding library for reproducible research

Python 797 128 Updated Apr 8, 2024

Getting interpretable dimensions in word embedding spaces.

Python 14 2 Updated Jul 6, 2023

Analyzing mBERT's multilinguality in a small laboratory setting

Python 13 2 Updated Jun 12, 2023

A list of selected resources, methods, and tools dedicated to Legal Text Analytics.

632 121 Updated Nov 5, 2024

Helper to create posts for Bayern Ticket Mitfahrer groups in Facebook.

HTML 2 Updated Apr 11, 2019

DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models

156 12 Updated Dec 6, 2022

Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper

Python 383 42 Updated Jun 23, 2024

Unsupervised text tokenizer focused on computational efficiency

C++ 965 103 Updated Mar 29, 2024

BERT-related papers

2,037 280 Updated Aug 12, 2023

Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)

Python 355 47 Updated Nov 7, 2023

Papers & presentation materials from Hugging Face's internal science day

2,046 118 Updated Oct 31, 2020

A Python framework for creating, editing, and invoking Noisy Intermediate-Scale Quantum (NISQ) circuits.

Python 4,432 1,064 Updated Feb 11, 2025

Language-Agnostic SEntence Representations

Jupyter Notebook 3,614 462 Updated May 2, 2024

A framework to learn cross-lingual word embedding mappings

Python 647 131 Updated Apr 22, 2023

👓 A web interface of gpustat: monitor GPU clusters at a look

Python 323 38 Updated Jan 4, 2024
Next