Vai al contenuto

Pagine più lunghe

Vengono mostrati sotto 50 risultati dal 151 al 200.

(cron) ‎On the difficulty of training recurrent neural networks ‎[1 488 byte]
(cron) ‎Masked-Language-Modeling (MLM) ‎[1 476 byte]
(cron) ‎ValueNet: A New Dataset for Human Value Driven Dialogue System ‎[1 464 byte]
(cron) ‎Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap ‎[1 455 byte]
(cron) ‎MiDaS ‎[1 434 byte]
(cron) ‎Are We Done with MMLU? ‎[1 424 byte]
(cron) ‎Modello linguistico di grandi dimensioni ‎[1 416 byte]
(cron) ‎ImageNet Large Scale Visual Recognition Challenge ‎[1 415 byte]
(cron) ‎Hallucinating Faces ‎[1 409 byte]
(cron) ‎Dall-e 3 (2023) ‎[1 390 byte]
(cron) ‎Contatti ‎[1 387 byte]
(cron) ‎Negative log-likelihood ‎[1 386 byte]
(cron) ‎Massive Multitask Language Understanding ‎[1 375 byte]
(cron) ‎Taming Transformers for High-Resolution Image Synthesis ‎[1 371 byte]
(cron) ‎Why think step by step? Reasoning emerges from the locality of experience (2023) ‎[1 371 byte]
(cron) ‎InstructGPT: Training Language Models to Follow Instructions with Human Feedback ‎[1 363 byte]
(cron) ‎SentencePiece ‎[1 360 byte]
(cron) ‎Extras ‎[1 348 byte]
(cron) ‎Google Model Garden ‎[1 327 byte]
(cron) ‎Geoffrey Hinton ‎[1 324 byte]
(cron) ‎Gorilla OpenFunctions ‎[1 324 byte]
(cron) ‎Crawling the Internal Knowledge-Base of Language Models ‎[1 316 byte]
(cron) ‎Finding Structure in Time ‎[1 313 byte]
(cron) ‎Few-shot learning ‎[1 309 byte]
(cron) ‎Step by Step ‎[1 288 byte]
(cron) ‎Very deep convolutional networks for large-scale image recognition ‎[1 254 byte]
(cron) ‎Deep Reinforcement Learning from Human Preferences ‎[1 253 byte]
(cron) ‎Deep Residual Learning for Image Recognition ‎[1 251 byte]
(cron) ‎An algorithm for suffix stripping ‎[1 248 byte]
(cron) ‎Generating Sequences With Recurrent Neural Networks (2014) ‎[1 235 byte]
(cron) ‎VGG16 (ConvNet) ‎[1 234 byte]
(cron) ‎Reasoning and Acting (prompting) ‎[1 220 byte]
(cron) ‎Bilingual Evaluation Understudy (BLEU) ‎[1 216 byte]
(cron) ‎Extended Long Short-Term Memory ‎[1 211 byte]
(cron) ‎Deep Convolutional Neural Networks (AlexNet) ‎[1 211 byte]
(cron) ‎ControlNet ‎[1 191 byte]
(cron) ‎Yoshua Bengio ‎[1 190 byte]
(cron) ‎Modello Generativo ‎[1 187 byte]
(cron) ‎LAION-5B ‎[1 187 byte]
(cron) ‎One-hot encodings ‎[1 186 byte]
(cron) ‎Funzione di attivazione ‎[1 179 byte]
(cron) ‎A Large-Scale Document-Level Relation Extraction Dataset ‎[1 172 byte]
(cron) ‎Zero 1-to-3 ‎[1 165 byte]
(cron) ‎An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction (CLINC150) ‎[1 162 byte]
(cron) ‎Cross-lingual Transfer Evaluation of Multilingual Encoders (XTREME) ‎[1 151 byte]
(cron) ‎AlexNet ‎[1 139 byte]
(cron) ‎In Search of Needles in a 11M Haystack:Recurrent Memory Finds What LLMs Miss ‎[1 120 byte]
(cron) ‎How to Fine Tune Bert for Sequence Classification? ‎[1 116 byte]
(cron) ‎Large Language Models as Zero-shot Dialogue State Tracker through Function Calling (16/02/2024) ‎[1 105 byte]
(cron) ‎PaLM ‎[1 091 byte]

Estratto da "https://wiki.mindmaker.it/index.php/Speciale:PaginePiùLunghe"