Categoria:Pubblicazione
Questa categoria usa il modulo modulo_pubblicazione.
Pagine nella categoria "Pubblicazione"
Questa categoria contiene le 119 pagine indicate di seguito, su un totale di 119.
A
- A Comprehensive Overview of Large Language Models
- A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques
- A Focused Backpropagation Algorithm for Temporal Pattern Recognition
- A Large-Scale Document-Level Relation Extraction Dataset
- A Neural Algorithm of Artistic Style (2015)
- A Neural Probabilistic Language Model
- A Theory for Emergence of Complex Skills in Language Models (2023)
- Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models
- An algorithm for suffix stripping
- Are Large Language Models Geospatially Knowledgeable?
- Are Sixteen Heads Really Better than One?
- Are We Done with MMLU?
- Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models
- Attention Is All You Need (2017)
- Automatic Stylistic Composition of Bach Chorales With Deep LSTM (2017)
B
- BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension
- BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models
- BERT for Classification: Beyond the Next Sentence Prediction Task
- BERT Rediscovers the Classical NLP Pipeline
- BLEU: a method for automatic evaluation of machine translation
C
- Chain of Thought Prompting Elicits Reasoning in Large Language Models
- Classifier-Free Diffusion Guidance
- COIL: Revisit exact lexical match in information retrieval with contextualized inverted list
- Context-Aware Term Weighting For First Stage Passage Retrieval
- Convolutional Neural Networks for Sentence Classification
- Crawling the Internal Knowledge-Base of Language Models
D
- Dall-e 3 (2023)
- Decoding Intelligence: A Framework for Certifying Knowledge Comprehension in LLMs
- Deep Contextualized Word Representations
- Deep Convolutional Neural Networks (AlexNet)
- Deep Reinforcement Learning from Human Preferences
- Deep Residual Learning for Image Recognition
- Deep Unsupervised Learning using Nonequilibrium Thermodynamics
- DeepJ: Style-Specific Music Generation (2018)
- Dense Passage Retrieval for Open-Domain Question Answering
- Diffusion Models Beat GANs on Image Synthesis
E
F
G
- Generating Sequences With Recurrent Neural Networks
- Generating Sequences With Recurrent Neural Networks (2014)
- Generative Adversarial Nets
- GloVe: Global Vectors for Word Representation
- Going Deeper with Convolutions
- Gorilla: Large Language Model Connected with Massive APIs
- GPT-4 Technical Report
- Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets
- Guiding Attention with Evidence for Improving Document-Level Relation Extraction (DREEAM)
H
- Hallucinating Faces
- Harmonizing Music the Boltzmann Way (1994)
- HellaSwag: Can a Machine Really Finish Your Sentence?
- Hierarchical Attention Networks for Document Classification
- High-Resolution Image Synthesis with Latent Diffusion Models
- Highly accurate protein structure prediction with AlphaFold
- How Many Data Points is a Prompt Worth?
- How to Fine Tune Bert for Sequence Classification?
I
L
- LAION-5B: An open large-scale dataset for training next generation image-text models
- Language Models are Few-Shot Learners
- Large Language Models as Zero-shot Dialogue State Tracker through Function Calling (16/02/2024)
- Learning long-term dependencies with gradient descent is difficult
- Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
- Learning Transferable Visual Models From Natural Language Supervision
- LLaMA: Open and Efficient Foundation Language Models
- Long Short-Term Memory
- LongAlign: A Recipe for Long Context Alignment of Large Language Models
M
- M3-Embedding: Multi-Linguality, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation
- MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection
- Massive Multitask Language Understanding
- MemGPT: Towards LLMs as Operating Systems
- MiDaS v3.1 – A Model Zoo for Robust Monocular Relative Depth Estimation
N
P
- Papers-in-100-Lines-of-Code
- Physics in Next-token Prediction
- Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions
- Powers of 10: Modeling Complex information-seeking systems at multiple scales
- Pre-training of Deep Bidirectional Transformers for Language Understanding
- Prefix-tuning: Optimizing continuous prompts for generation
R
S
T
- Taming Transformers for High-Resolution Image Synthesis
- Technical Report, Palm 2
- The Natural Language Decathlon: Multitask Learning as Question Answering
- The perceptron: A probabilistic model for information storage and organization in the brain
- The Schema-Guided Dialogue Dataset
- The Theory of Stochastic Processes, with Particular Reference to Applications
- The Unreasonable Effectiveness of Recurrent Neural Networks
- Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer
- Training Compute-Optimal Large Language Models
- Training language models to follow instructions with human feedback