Pagina principale

Da Wiki AI.

Aiuto

Architetture

 NomeIngleseSiglaAnnoDiCreazione
AutoencoderAutoencoder1993
BARTBidirectional and Auto-Regressive TransformersBART29 ottobre 2019
Contrastive Language-Image Pretraining (CLIP)Contrastive Language-Image PretrainingCLIP2021
ControlNetControlNetControlNetfebbraio 2023
Detection TransformerDetection TransformerDeTr2020
Extended Long Short-Term MemoryExtended Long Short-Term MemoryxLSTM2024
Long Short-Term Memory (LSTM)Long Short-Term MemoryLSTM1997
Macchine di Boltzmann Restrittive (RBM)Restricted Boltzmann MachineRBM1986
MiDaSMulti-scale Deep StereoMiDaS2019
Mixture of Experts
Modello di Diffusione Latente (LDM)Latent Diffusion ModelLDM2021
Modello linguistico di grandi dimensioni per il linguaggio parlato
Rete Generativa AvversariaGenerative Adversarial NetworkGAN2014
Rete Neurale Artificiale (ANN)Artificial Neural NetworkANN1957
Rete Neurale Feed-Forward (FNN)Feed-Forward Neural NetworkFNN1958
Rete Neurale Residua (ResNet)Residual Neural NetworkResNet2015
Rete Neurale Ricorrente (RNN)Recurrent Neural NetworksRNN1990
Reti Neurali Convoluzionali (CNN)Convolutional Neural NetworksCNN1995
Sequence to Sequence (seq2seq)Sequence to Sequence Modelseq2seq2014
Transformer (Architettura di Deep Learning)Transformer2017
Vision Transformer (ViT)Vision TransformerViT2021

Modelli

 SiglaAnnoDiCreazioneVersioneCorrenteBasatoSu
AlexNet2012
AlpacaAlpaca2023
AlphaFold
BERTBERT2018
Biaxial LSTM (DeepJ - musica)Biaxial LSTMLong Short-Term Memory (LSTM)
ConceptNet
Contriever
DeepDream18 giugno 2015
GLoVeGLoVe2014GLoVe v.1.2 (2015)
Generative Pretrained Transformer (GPT)GPT2018GPT-4o (2024)
GoogLeNet
Gorilla OpenFunctions2023Architettura dei Large Language Models (LLM)
InstructGPTInstructGPT27 gennaio 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions2023
LeNet1995LeNet-5
LlamaLLaMA20213.0
MistralMistral2023
NETtalk1993
O1
OpenAI o1
PaLMPaLM2022PaLM 2 (2023)
SPLADE
Stable Diffusion2022SD3 (2023)
StyleGANStyleGAN2019StyleGAN 3 (2021)
VGG16 (ConvNet)VGG162014
XLM-RoBERTaXLM-RoBERTa2020
Zero 1-to-32023

Benchmark

 NomeSiglaAnnoDiCreazione
An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction (CLINC150)An Evaluation Dataset for Intent Classification and Out-of-Scope PredictionCLINC1502019
BABILongBABILong2024
Bilingual Evaluation Understudy (BLEU)BiLingual Evaluation UnderstudyBLEU2002
BoolQBoolean QuestionsBoolQ2019
Cross-lingual Transfer Evaluation of Multilingual Encoders (XTREME)Cross-lingual Transfer Evaluation of Multilingual EncodersXTREME2020
Discrete Reasoning Over Paragraphs (DROP)DROPDROP2019
DocRED: A Large-Scale Document-Level Relation Extraction DatasetA Large-Scale Document-Level Relation Extraction DatasetDocRED2019
GSM8KGrade School Math 8KGSM8K2022
General Language Understanding Evaluation (GLUE)General Language Understanding EvaluationGLUE2018
HMDB: a large human motion databaseA large human motion databaseHMDB2011
HellaSwagHellaSwagHellaSwag2019
HumanEvalHumanEval2021
ImageNet Large Scale Visual Recognition ChallengeImageNet Large Scale Visual Recognition ChallengeILSVRC2012
LAION-5BLarge-scale Artificial Intelligence Open Network-5 BillionLAION-5B2021
LongAlignLongAlign2024
MATHMATH2021
MBPPMostly Basic Programming ProblemsMBPP2021
MMLUMassive Multitask Language UnderstandingMMLU2021
MS COCOMicrosoft Common Objects in ContextMS COCO2014
Microsoft Machine Reading Comprehension (MS MARCO)
Mind2WebMind2WebMind2Web2023
NaturalQuestionsNaturalQuestions2019
QuACQuestion Answering in ContextQuAC2018
SQuADStanford Question Answering DatasetSQuAD2018
Schema di Winograd
Semantic Textual Similarity (STS)Semantic Textual SimilaritySTS2012
UCF101 - Action Recognition Data SetAction Recognition Data SetUCF1012013
WinoGrandeWinoGrande2019

Paper

Categoria paper non trovata

NLP (Natural Language Processing)

Musica

MIDI

Concetti

Matematica

Apprendimento

Esecuzione e Inferenza

  • Quantizzazione: Riduzione della precisione dei numeri per accelerare l'esecuzione dei modelli.
  • Metodi di Decoding: Tecniche per generare output dai modelli di linguaggio.

Benchmarking

Ragionamento negli LLM (Large Language Models)

Dataset

Benchmarks

Modelli di Linguaggio

Benchmark Aggregati

Capacità di Ragionamento

  • HellaSwag
  • DROP
  • WinoGrande - Sakaguchi et al., 2021
  • Arc C
  • PIQA - Bisk et al., 2020
  • SIQA - Sap et al., 2019
  • CommonsenseQA - Talmor et al., 2018

Conoscenza

Codice

Comprensione del Testo

  • SQuAD - Rajpurkar et al., 2018
  • QuAC - Choi et al., 2018
  • BoolQ - Clark et al., 2019)
  • LongAlign - Yushi Bai et al., 2024
  • BABILong - Yuri Kuratov et al., 2024

Matematica

  • GSM8K - Cobbe et al., 2021
  • MATH - Hendrycks et al., 2021

Embeddings

  • MIRACL
  • MTEB

Video

Servizi Cloud

Google

Tutorial AI Lab

PyTorch