Pagine più lunghe
Vengono mostrati sotto 50 risultati dal 151 al 200.
- (cron) On the difficulty of training recurrent neural networks [1 488 byte]
- (cron) Masked-Language-Modeling (MLM) [1 476 byte]
- (cron) ValueNet: A New Dataset for Human Value Driven Dialogue System [1 464 byte]
- (cron) Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap [1 455 byte]
- (cron) MiDaS [1 434 byte]
- (cron) Are We Done with MMLU? [1 424 byte]
- (cron) Modello linguistico di grandi dimensioni [1 416 byte]
- (cron) ImageNet Large Scale Visual Recognition Challenge [1 415 byte]
- (cron) Hallucinating Faces [1 409 byte]
- (cron) Dall-e 3 (2023) [1 390 byte]
- (cron) Contatti [1 387 byte]
- (cron) Negative log-likelihood [1 386 byte]
- (cron) Massive Multitask Language Understanding [1 375 byte]
- (cron) Taming Transformers for High-Resolution Image Synthesis [1 371 byte]
- (cron) Why think step by step? Reasoning emerges from the locality of experience (2023) [1 371 byte]
- (cron) InstructGPT: Training Language Models to Follow Instructions with Human Feedback [1 363 byte]
- (cron) SentencePiece [1 360 byte]
- (cron) Extras [1 348 byte]
- (cron) Google Model Garden [1 327 byte]
- (cron) Geoffrey Hinton [1 324 byte]
- (cron) Gorilla OpenFunctions [1 324 byte]
- (cron) Crawling the Internal Knowledge-Base of Language Models [1 316 byte]
- (cron) Finding Structure in Time [1 313 byte]
- (cron) Few-shot learning [1 309 byte]
- (cron) Step by Step [1 288 byte]
- (cron) Very deep convolutional networks for large-scale image recognition [1 254 byte]
- (cron) Deep Reinforcement Learning from Human Preferences [1 253 byte]
- (cron) Deep Residual Learning for Image Recognition [1 251 byte]
- (cron) An algorithm for suffix stripping [1 248 byte]
- (cron) Generating Sequences With Recurrent Neural Networks (2014) [1 235 byte]
- (cron) VGG16 (ConvNet) [1 234 byte]
- (cron) An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction (CLINC150) [1 222 byte]
- (cron) Reasoning and Acting (prompting) [1 220 byte]
- (cron) Bilingual Evaluation Understudy (BLEU) [1 216 byte]
- (cron) Extended Long Short-Term Memory [1 211 byte]
- (cron) Deep Convolutional Neural Networks (AlexNet) [1 211 byte]
- (cron) ControlNet [1 191 byte]
- (cron) Yoshua Bengio [1 190 byte]
- (cron) Modello Generativo [1 187 byte]
- (cron) LAION-5B [1 187 byte]
- (cron) One-hot encodings [1 186 byte]
- (cron) Funzione di attivazione [1 179 byte]
- (cron) A Large-Scale Document-Level Relation Extraction Dataset [1 172 byte]
- (cron) Zero 1-to-3 [1 165 byte]
- (cron) Cross-lingual Transfer Evaluation of Multilingual Encoders (XTREME) [1 151 byte]
- (cron) AlexNet [1 139 byte]
- (cron) In Search of Needles in a 11M Haystack:Recurrent Memory Finds What LLMs Miss [1 120 byte]
- (cron) How to Fine Tune Bert for Sequence Classification? [1 116 byte]
- (cron) Large Language Models as Zero-shot Dialogue State Tracker through Function Calling (16/02/2024) [1 105 byte]
- (cron) PaLM [1 091 byte]