Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
-
Updated
Jan 30, 2023 - Python
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
Language Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
Pre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
Mengzi Pretrained Models
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
This repository contains code and datasets related to entity/knowledge papers from the VERT (Versatile Entity Recognition & disambiguation Toolkit) project, by the Knowledge Computing group at Microsoft Research Asia (MSRA).
Triple Branch BERT Siamese Network for fake news classification on LIAR-PLUS dataset in PyTorch
Awesome resources for in-context learning and prompt engineering: Mastery of the LLMs such as ChatGPT, GPT-3, and FlanT5, with up-to-date and cutting-edge updates.
Pre-training of Language Models for Language Understanding
a collection of NLP projects&tools. 自然语言处理方向项目和工具集合。
[eLife 2020] "Comprehension of computer code relies primarily on domain-general executive brain regions" by Anna A. Ivanova, Shashank Srikant, Yotaro Sueoka, Hope H. Kean, Riva Dhamala, Una-May O'Reilly, Marina U. Bers, Evelina Fedorenko
[NeurIPS 2022] "Convergent Representations of Computer Programs in Human and Artificial Neural Networks" by Shashank Srikant*, Benjamin Lipkin*, Anna A. Ivanova, Evelina Fedorenko, Una-May O'Reilly.
Dataset parsers from the SuperGLUE benchmark https://super.gluebenchmark.com/tasks/
Neural network model to measure semantic similarity between sentences.
The tok function is a JavaScript and Node.js function that processes object instances and tokenizes text arrays. It returns tokenized words number, tokenized words array, and tokenized words concatenated string. It's part of the open-source DropSuit NLP library under the Apache License 2.0.
iOS application for thesis. Mobile chatbot app for banking topics. Language understanding is provided with Microsoft LUIS and Google NLP. The project is for learning purposes
DropSuit - NLP & data manipulation library for JS & Node.js. Offers diverse functions for text analysis, language understanding & more. Open-source under Apache License 2.0.
Software AG Natural Explorer
Chatbot that use Conversational Language Understanding And Custom question answering
The enoun function is a JavaScript and Node.js function that is part of the DropSuit NLP library. It filters text to only include English nouns. It's open-source and available under the Apache License 2.0.
Add a description, image, and links to the language-understanding topic page so that developers can more easily learn about it.
To associate your repository with the language-understanding topic, visit your repo's landing page and select "manage topics."