Pagine meno recenti

Vengono mostrati sotto 50 risultati dal 201 al 250.

Vedi ( | ) (20 | 50 | 100 | 250 | 500).

  1. Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets‏‎ (10:10, 19 ago 2024)
  2. Grokking‏‎ (11:25, 19 ago 2024)
  3. Are We Done with MMLU?‏‎ (11:30, 19 ago 2024)
  4. Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models‏‎ (11:53, 19 ago 2024)
  5. Confabulazione‏‎ (12:00, 19 ago 2024)
  6. Language Models are Few-Shot Learners‏‎ (13:09, 19 ago 2024)
  7. How Many Data Points is a Prompt Worth?‏‎ (13:10, 19 ago 2024)
  8. Prefix-Tuning‏‎ (13:17, 19 ago 2024)
  9. Prefix-tuning: Optimizing continuous prompts for generation‏‎ (13:18, 19 ago 2024)
  10. LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models‏‎ (09:44, 22 ago 2024)
  11. ConceptNet‏‎ (20:36, 24 ago 2024)
  12. What's in a Name? -- Gender Classification of Names with Character Based Machine Learning Models‏‎ (20:45, 26 ago 2024)
  13. End-to-End Object Detection with Transformers‏‎ (11:53, 28 ago 2024)
  14. Generating Sequences With Recurrent Neural Networks (2014)‏‎ (11:54, 28 ago 2024)
  15. Detection Transformer‏‎ (11:56, 28 ago 2024)
  16. Non-Maximum Suppression‏‎ (12:43, 28 ago 2024)
  17. YOLO-World: Real-Time Open-Vocabulary Object Detection‏‎ (13:02, 28 ago 2024)
  18. Judging LLM-as-a-judge with MT-Bench and Chatbot Arena‏‎ (20:11, 28 ago 2024)
  19. LLM-as-a-judge‏‎ (20:12, 28 ago 2024)
  20. Vocabolario‏‎ (10:46, 5 set 2024)
  21. None‏‎ (10:20, 6 set 2024)
  22. Deep Contextualized Word Representations‏‎ (10:24, 6 set 2024)
  23. A Neural Probabilistic Language Model‏‎ (10:27, 6 set 2024)
  24. Neural Machine Translation by Jointly Learning to Align and Translate‏‎ (10:29, 6 set 2024)
  25. Hallucinating Faces‏‎ (11:02, 6 set 2024)
  26. Allucinazione‏‎ (11:11, 6 set 2024)
  27. Red Teaming‏‎ (11:16, 6 set 2024)
  28. Chain of Thought Prompting Elicits Reasoning in Large Language Models‏‎ (11:26, 6 set 2024)
  29. Survey of Hallucination in Natural Language Generation‏‎ (11:29, 6 set 2024)
  30. Chain of Thought‏‎ (11:30, 6 set 2024)
  31. Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap‏‎ (12:00, 6 set 2024)
  32. Efficient Estimation of Word Representations in Vector Space‏‎ (12:01, 6 set 2024)
  33. Going Deeper with Convolutions‏‎ (12:10, 6 set 2024)
  34. Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions‏‎ (13:44, 6 set 2024)
  35. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank‏‎ (18:27, 6 set 2024)
  36. Convolutional Neural Networks for Sentence Classification‏‎ (18:54, 6 set 2024)
  37. LLaMA: Open and Efficient Foundation Language Models‏‎ (19:02, 6 set 2024)
  38. Masked-Language-Modeling (MLM)‏‎ (19:18, 6 set 2024)
  39. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension‏‎ (06:30, 8 set 2024)
  40. BART‏‎ (06:32, 8 set 2024)
  41. Generative Pretrained Transformer (GPT)‏‎ (06:32, 8 set 2024)
  42. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation‏‎ (18:08, 8 set 2024)
  43. Forward pass‏‎ (18:10, 8 set 2024)
  44. XLSTM: Extended Long Short-Term Memory‏‎ (04:53, 9 set 2024)
  45. Long Short-Term Memory (LSTM)‏‎ (04:55, 9 set 2024)
  46. Long Short-Term Memory‏‎ (04:56, 9 set 2024)
  47. BLEU: a method for automatic evaluation of machine translation‏‎ (05:01, 9 set 2024)
  48. Bilingual Evaluation Understudy (BLEU)‏‎ (05:02, 9 set 2024)
  49. Obiettivo di pre-training‏‎ (10:37, 10 set 2024)
  50. What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization?‏‎ (10:40, 10 set 2024)

Vedi ( | ) (20 | 50 | 100 | 250 | 500).