https://summary-of-some-paper-in-cuda.readthedocs.io/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2302.04761-Toolformer-LanguageModelsCanTeachThemselvestoUseTools/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2501.06252-TRANSFORMER-SQUARED-SELF-ADAPTIVELLMS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2501.12948-DeepSeek-R1-IncentivizingReasoningCapabilityinLLMsviaReinfor/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2506.20249-LanguageModelingbyLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2511.03773-ScalingAgentLearningviaExperienceSynthesis/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2511.15593-WhatDoesItTaketoBeaGoodAIResearchAgent%3FStudyingtheRoleofIdea/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.02551-CUDA-L2-Surpassing-cuBLAS-Performance-for-Matrix-Multiplication-through-Reinforc/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.08296-Towards-a-Science-of-Scaling-Agent-Systems/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.10398-Confucius-Code-Agent-Scalable-Agent-Scaffolding-for-Real-World-Codebases/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.12967-QwenLong-L1.5-Post-Training-Recipe-for-Long-Context-Reasoning-and-Memory-Managem/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.13564-Memory-in-the-Age-of-AI-Agents-A-SurveyForms-Functions-and-Dynamics/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.15176-Draft-with-Diffusion-Verify-with-Autoregressive-Models/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.20848-Nemotron-3-Nano-Open-Efficient-Mixture-of-Experts-Hybrid-Mamba-Transformer-Model/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.20856-NVIDIA-Nemotron-3-Efficient-and-Open-Intelligence/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.23236-KernelEvolve-Scaling-Agentic-Kernel-Coding-for-Heterogeneous-AI-Accelerators-at-/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2512.23676-Web-World-Models/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2603.11327-Meta-Reinforcement-Learning-with-Self-Reflection-for-Agentic-Search/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/agents/2604.08516-MolmoWeb-Open-Visual-Web-Agent-and-Open-Data-for-the-Open-Web/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/2409.02795-TowardsaUnifiedViewofPreferenceLearningforLargeLanguageModel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/2501.07301-TheLessonsofDevelopingProcessRewardModelsinMathematicalReaso/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/2501.12599-KIMIK1.5-SCALINGREINFORCEMENTLEARNINGWITHLLMS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/CamelsinaChangingClimate-EnhancingLMAdaptationwithT%C3%9CLU2/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/DeepReinforcementLearningfromHumanPreferences/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/DirectPreferenceOptimization-YourLanguageModelisSecretlyaRew/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/HERMES3TECHNICALREPORT/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/INSTRUCTIONTUNINGWITHGPT-4/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/LIMA-LessIsMoreforAlignment/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/LargeReasoningModelsLearnBetterAlignmentfromFlawedThinking/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/Learningtosummarizefromhumanfeedback/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/Let%E2%80%99sVerifyStepbyStep/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/Llama2-OpenFoundationandFine-TunedChatModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/MULTIPLAYERNASHPREFERENCEOPTIMIZATION/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/OnscalableoversightwithweakLLMsjudgingstrongLLMs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/RLHFWorkflow-FromRewardModelingtoOnlineRLHF%E2%80%93AComprehensivePr/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/ReinforcementLearningfromHumanFeedback-AshortintroductiontoR/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/ScalingInstruction-FinetunedLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/Simplesyntheticdatareducessycophancyinlargelanguagemodels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/Traininglanguagemodelstofollowinstructionswithhumanfeedback/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/TruthRL-IncentivizingTruthfulLLMsviaReinforcementLearning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/T%C3%BClu3-PushingFrontiersinOpenLanguageModelPost-Training/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/alignment/UFT-UnifyingFine-TuningofSFTandRLHF-DPO-UNAthroughaGeneraliz/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/2506.05200-Transformers-Meet-In-Context-Learning-A-Universal-Approximation-Theory/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/2603.02188-MULTI-HEAD-LOW-RANK-ATTENTION/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/ANIMAGEISWORTH16X16WORDS-TRANSFORMERSFORIMAGERECOGNITIONATSC/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/ASurveyonDiffusionLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/AttentionIsAllYouNeed/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/AutoregressiveUniversalVideoSegmentationModel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Character-levelConvolutionalNetworksforTextClassification/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/DIFFERENTIALTRANSFORMER/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/DIFFUSIONTRANSFORMERSWITHREPRESENTATIONAUTOENCODERS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/DeepResidualLearningforImageRecognition/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/DeepSeekMoE-TowardsUltimateExpertSpecializationinMixture-of-/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/EXAONE4.0-UnifiedLargeLanguageModelsIntegratingNon-reasoning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/EfficientNet-RethinkingModelScalingforConvolutionalNeuralNet/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/EveryAttentionMatters-AnEfficientHybridArchitectureforLong-C/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/FASTANDSIMPLEX-2-SIMPLICIALATTENTIONINTRI-TON/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/FLEXOLMO-OpenLanguageModelsforFlexibleDataUse/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Falcon-H1-AFamilyofHybrid-HeadLanguageModelsRedefiningEffici/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/FastR-CNN/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/FullyConvolutionalNetworksforSemanticSegmentation/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/GPipe-EasyScalingwithMicro-BatchPipelineParallelism/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Gemma3TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/GeneratingLongSequenceswithSparseTransformers/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/HOWPOWERFULAREGRAPHNEURALNETWORKS%3F/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Jamba-1.5-HybridTransformer-MambaModelsatScale/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Jamba-AHybridTransformer-MambaLanguageModel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/KIMILINEAR-ANEXPRESSIVE%2CEFFICIENTATTENTIONARCHITECTURE/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/LM2-LargeMemoryModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/LONGNET-ScalingTransformersto1%2C000%2C000%2C000Tokens/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Llama-Nemotron-EfficientReasoningModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Log-LinearAttention/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Mamba-Linear-TimeSequenceModelingwithSelectiveStateSpaces/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Megatron-LM-TrainingMulti-BillionParameterLanguageModelsUsin/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/MixtralofExperts/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Mixture-of-Transformers-ASparseandScalableArchitectureforMul/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/MobileLLM-OptimizingSub-billionParameterLanguageModelsforOn-/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/MolmoAct-ActionReasoningModelsthatcanReasoninSpace/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Multi-TokenAttention/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/NVIDIANemotronNano2-AnAccurateandEfficientHybridMamba-Transf/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/NativeSparseAttention-Hardware-AlignedandNativelyTrainableSp/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/OLMoE-OpenMixture-of-ExpertsLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/OUTRAGEOUSLYLARGENEURALNETWORKS-THESPARSELY-GATEDMIXTURE-OF-/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/PaddleOCR-VL-BoostingMultilingualDocumentParsingviaa0.9BUltr/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/PyTorch-AnImperativeStyle%2CHigh-PerformanceDeepLearningLibrar/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/QWEN2TECHNICALREPORT/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Qwen3TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/RPG-AREPOSITORYPLANNINGGRAPHFORUNIFIEDANDSCALABLECODEBASEGEN/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/RecurrentGemma-MovingPastTransformersforEfficientOpenLanguag/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/ReplacingsoftmaxwithReLUinVisionTransformers/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/SCALINGLAWSMEETMODELARCHITECTURE-TOWARDINFERENCE-EFFICIENTLL/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/ScalableDiffusionModelswithTransformers/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/ScalingLatentReasoningviaLoopedLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/SequencetoSequenceLearningwithNeuralNetworks/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/TheLlama3HerdofModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/TrainingLargeLanguageModelstoReasoninaContinuousLatentSpace/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/TransformersareSSMs-GeneralizedModelsandEfficientAlgorithmsT/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/TransformerswithoutNormalization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/VIBEVOICETechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/XGBoost-AScalableTreeBoostingSystem/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/YouOnlyLookOnce-Unified%2CReal-TimeObjectDetection/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/Zero-ShotText-to-ImageGeneration/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/architecture/doi-10.1038-nature14539-Deep-learning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/code/2511.00839-CodeClash-BenchmarkingGoal-OrientedSoftwareEngineering/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/2408.01367-Transformers-are-Universal-In-context-Learners/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/2501.08313-MiniMax-01-ScalingFoundationModelswithLightningAttention/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/2510.27258-Higher-orderLinearAttention/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/AComprehensiveSurveyonLongContextLanguageModeling/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/AControlledStudyonLongContextExtensionandGeneralizationinLLM/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/ASurveyofContextEngineeringforLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/Deepcontextualizedwordrepresentations/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/LLM2Vec-LargeLanguageModelsAreSecretlyPowerfulTextEncoders/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/LLaVA-CoT-LetVisionLanguageModelsReasonStep-by-Step/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/LongCodeZip-CompressLongContextforCodeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/LongRoPE-ExtendingLLMContextWindowBeyond2MillionTokens/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/context-optimization/YaRN-EfficientContextWindowExtensionofLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/data/2501.08365-TowardsBestPracticesforOpenDatasetsforLLMTraining/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/diffusion/2512.15745-LLaDA2.0-Scaling-Up-Diffusion-Language-Models-to-100B/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/diffusion/2604.06916-FP4-Explore-BF16-Train-Diffusion-Reinforcement-Learning-via-Efficient-Rollout-Sc/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/distributed-training/2501.18512-StreamingDiLoCowithoverlappingcommunication-TowardsaDistribu/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2102.12452-Probing-Classifiers-Promises-Shortcomings-and-Advances/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2402.07841-DoMembershipInferenceAttacksWorkonLargeLanguageModels%3F/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2406.08446-OLMES-AStandardforLanguageModelEvaluations/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2411.05403-BenchmarkingDistributionalAlignmentofLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2501.14249-Humanity%E2%80%99sLastExam/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2501.15654-PeoplewhofrequentlyuseChatGPTforwritingtasksareaccurateandr/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/2512.14691-MMGR-Multi-Modal-Generative-Reasoning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/ASurveyonEvaluationofLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/AttentionHeadsofLargeLanguageModels-ASurvey/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/CanLLMsGenerateNovelResearchIdeas%3FALarge-ScaleHumanStudywith/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/CarlemanEstimatesandControllabilityofForwardStochasticParabo/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/ChatbotArena-AnOpenPlatformforEvaluatingLLMsbyHumanPreferenc/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/DeepSeek-R1Thoughtology-Let%E2%80%99sthinkaboutLLMreasoning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/DocumentParsingUnveiled-Techniques%2CChallenges%2CandProspectsfo/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/EFFICIENTLLM-EFFICIENCYINLARGELANGUAGEMODELSEVALUATIONONARCH/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/EvaluatingLargeLanguageModelsTrainedonCode/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/GENERALIZATIONV.S.MEMORIZATION-TRACINGLANGUAGEMODELS%E2%80%99CAPABIL/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/GPQA-AGraduate-LevelGoogle-ProofQ%26ABenchmark/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/HOLISTICALLYEVALUATINGTHEENVIRONMENTALIMPACTOFCREATINGLANGUA/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/HolisticEvaluationofLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/HowIsChatGPT%E2%80%99sBehaviorChangingoverTime%3F/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/ImageNetLargeScaleVisualRecognitionChallenge/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/IsChain-of-ThoughtReasoningofLLMsaMirage%3FADataDistributionLe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/JudgingLLM-as-a-JudgewithMT-BenchandChatbotArena/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/KernelBench-CanLLMsWriteEfficientGPUKernels%3F/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/LMSYS-CHAT-1M-ALARGE-SCALEREAL-WORLDLLMCONVERSATIONDATASET/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/M3DSYNTH-ADATASETOFMEDICAL3DIMAGESWITHAI-GENERATEDLOCALMANIP/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/MMLU-Pro-AMoreRobustandChallengingMulti-TaskLanguageUndersta/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/MMMU-Pro-AMoreRobustMulti-disciplineMultimodalUnderstandingB/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/MicrosoftCOCO-CommonObjectsinContext/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/NotAllLLMReasonersAreCreatedEqual/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/PaperBench-EvaluatingAI%E2%80%99sAbilitytoReplicateAIResearch/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/PremiseOrderMattersinReasoningwithLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/REASONINGGYM-ReasoningEnvironmentsforReinforcementLearningwi/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/RULER-What%E2%80%99stheRealContextSizeofYourLong-ContextLanguageMode/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/RewardBench-EvaluatingRewardModelsforLanguageModeling/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/SUMOFTHEGL%283%29FOURIERCOEFFICIENTSOVERQUADRATICSANDMIXEDPOWERS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/SWE-Perf-CanLanguageModelsOptimizeCodePerformanceonReal-Worl/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/TheAutomatedLLMSpeedrunningBenchmark-ReproducingNanoGPTImpro/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/TheLeaderboardIllusion/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/UndesirableMemorizationinLargeLanguageModels-ASurvey/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/VibeChecker-AligningCodeEvaluationwithHumanPreference/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/WhyDoMulti-AgentLLMSystemsFail%3F/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/WhyLanguageModelsHallucinate/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/evaluation/alphafold2-2021-Unable-to-Extract---No-Paper-Content-Provided/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/1911.00172-GENERALIZATIONTHROUGHMEMORIZATION-NEARESTNEIGHBORLANGUAGEMOD/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2512.12087-BLASST-Dynamic-BLocked-Attention-Sparsity-via-Softmax-Thresholding/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2512.23675-End-to-End-Test-Time-Training-for-Long-Context/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2603.05451-FlashAttention-4-Algorithm-and-Kernel-Pipelining-Co-Design-for-Asymmetric-Hardwa/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2603.12201-IndexCache-Accelerating-Sparse-Attention-via-Cross-Layer-Index-Reuse/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2603.28342-Kernel-Smith-A-Unified-Recipe-for-Evolutionary-Kernel-Optimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2604.04921-TriAttention-Efficient-Long-Reasoning-with-Trigonometric-KV-Compression/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/2604.08302-DMax-Aggressive-Parallel-Decoding-for-dLLMs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ASurveyofSmallLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/AUXILIARY-LOSS-FREELOADBALANCINGSTRATEGYFORMIXTURE-OF-EXPERT/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/AcceleratingLLMInferencewithStagedSpeculativeDecoding/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Astra-AMulti-AgentSystemforGPUKernelPerformanceOptimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/BreaktheSequentialDependencyofLLMInferenceUsingLOOKAHEADDECO/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Can1BLLMSurpass405BLLM%3FRethinkingCompute-OptimalTest-TimeSca/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Chain-of-ThoughtReasoningwithoutPrompting/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ChunkAttention-EfficientSelf-AttentionwithPrefix-AwareKVCach/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/CompactLanguageModelsviaPruningandKnowledgeDistillation/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ContextParallelismforScalableMillion-TokenInference/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/DoesMoreInference-TimeComputeReallyHelpRobustness%3F/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/EAGLE-SpeculativeSamplingRequiresRethinkingFeatureUncertaint/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/EFFICIENTLYSCALINGTRANSFORMERINFERENCE/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/EFFICIENTSTREAMINGLANGUAGEMODELSWITHATTENTIONSINKS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/EfficientlyScalingLLMReasoningwithCertaindex/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FAST-DLLMV2-EfficientBlock-DiffusionLLM/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FLEXATTENTION-APROGRAMMINGMODELFORGENERATINGOPTIMIZEDATTENTI/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Fast-dLLM-Training-freeAccelerationofDiffusionLLMbyEnablingK/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FlashAttention-2-FasterAttentionwithBetterParallelismandWork/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FlashAttention-3-FastandAccurateAttentionwithAsynchronyandLo/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FlashAttention-FastandMemory-EfficientExactAttentionwithIO-A/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FlashFFTConv-EfficientConvolutionsforLongSequenceswithTensor/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FlashInfer-EfficientandCustomizableAttentionEngineforLLMInfe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/FlexGen-High-ThroughputGenerativeInferenceofLargeLanguageMod/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/GQA-TrainingGeneralizedMulti-QueryTransformerModelsfromMulti/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Inference-TimeScalingforGeneralistRewardModeling/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/InferenceScalingforLong-ContextRetrievalAugmentedGeneration/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/InfiniteHiP-ExtendingLanguageModelContextUpto3MillionTokenso/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/LLMinaflash-EfficientLargeLanguageModelInferencewithLimitedM/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/LargeLanguageMonkeys-ScalingInferenceComputewithRepeatedSamp/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/LearningAdaptiveParallelReasoningwithLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Locality-awareParallelDecodingforEfficientAutoregressiveImag/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/MEDUSA-SimpleLLMInferenceAccelerationFrameworkwithMultipleDe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/MUTUALREASONINGMAKESSMALLERLLMSSTRONGERPROBLEM-SOLVERS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/MiniMax-M1-ScalingTest-TimeComputeEfficientlywithLightningAt/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ParallelScalingLawforLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/RECURRENTDRAFTERFORFASTSPECULATIVEDECODINGINLARGELANGUAGEMOD/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/SLA-BEYONDSPARSITYINDIFFUSIONTRANSFORMERSVIAFINE-TUNABLESPAR/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/SageAttention2-EfficientAttentionwithThoroughOutlierSmoothin/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ScalingLLMTest-TimeComputeOptimallycanbeMoreEffectivethanSca/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Sleep-timeCompute-BeyondInferenceScalingatTest-time/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/SpecInfer-AcceleratingLargeLanguageModelServingwithTree-base/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/Squid-LongContextasaNewModalityforEnergy-EfficientOn-DeviceL/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/StayontopicwithClassifier-FreeGuidance/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/THEPITFALLSOFKVCACHECOMPRESSION/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/TVM-AnAutomatedEnd-to-EndOptimizingCompilerforDeepLearning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/TheEndofManualDecoding-TowardsTrulyEnd-to-EndLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/TheMambaintheLlama-DistillingandAcceleratingHybridModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/ThunderKittens-Simple%2CFast%2CandAdorableAIKernels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/TransMLA-MLAIsAllYouNeed/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/TuningLanguageModelsbyProxy/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/inference-optimization/s1-Simpletest-timescaling/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2204.06745-GPT-NeoX-20B-AnOpen-SourceAutoregressiveLanguageModel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2310.16789-DETECTINGPRETRAININGDATAFROMLARGELANGUAGEMODELS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2407.04620-Learningto%28LearnatTestTime%29-RNNswithExpressiveHiddenStates/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2408.11796-LLMPruningandDistillationinPractice-TheMinitronApproach/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2501.09891-EvolvingDeeperLLMThinking/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2507.06261-Gemini2.5-PushingtheFrontierwithAdvancedReasoning%2CMultimodal/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2510.27656-RDMAPOINT-TO-POINTCOMMUNICATIONFORLLMSYSTEMS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.02038-DeepResearch-ASystematicSurvey/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.02556-DeepSeek-V3.2-PushingtheFrontierofOpenLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.11251-Insight-Miner-A-Time-Series-Analysis-Dataset-for-Cross-Domain-Alignment-with-Nat/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.13961-Olmo-3/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.15586-Bolmo-Byteifying-the-Next-Generation-of-Language-Models/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.16093-TurboDiffusion-Accelerating-Video-Diffusion-Models-by-100-200-Times/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2512.23165-Evaluating-Parameter-Efficient-Methods-for-RLVR/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2601.01739-K-EXAONE-Technical-ReportJourney-to-Frontier-Level-Performance-of-Foundation-Mod/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/2601.09012-TranslateGemma-Technical-Report/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/ASURVEYOFSELF-EVOLVINGAGENTS-ONPATHTOARTIFICIALSUPERINTELLIG/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Alpa-AutomatingInter-andIntra-OperatorParallelismforDistribu/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Caffe-ConvolutionalArchitectureforFastFeatureEmbedding/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/ChatGLM-AFamilyofLargeLanguageModelsfromGLM-130BtoGLM-4AllTo/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/DIVERSITYEMPOWERSINTELLIGENCE-INTEGRATINGEXPERTISEOFSOFTWARE/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/DeepMMSearch-R1-EmpoweringMultimodalLLMsinMultimodalWebSearc/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Gold-medalistPerformanceinSolvingOlympiadGeometrywithAlphaGe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/InsightsintoDeepSeek-V3-ScalingChallengesandReflectionsonHar/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/LargeLanguageModelAgent-ASurveyonMethodology%2CApplicationsand/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/MXNet-AFlexibleandEfficientMachineLearningLibraryforHeteroge/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Memento-Fine-tuningLLMAgentswithoutFine-tuningLLMs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Mixture-of-AgentsEnhancesLargeLanguageModelCapabilities/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Mooncake-AKVCache-centricDisaggregatedArchitectureforLLMServ/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/OLMOTRACE-TracingLanguageModelOutputsBacktoTrillionsofTraini/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/OLMo-AcceleratingtheScienceofLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/PyTorchFSDP-ExperiencesonScalingFullyShardedDataParallel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Ray-ADistributedFrameworkforEmergingAIApplications/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/Relax-ComposableAbstractionsforEnd-to-EndDynamicMachineLearn/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/SGLang-EfficientExecutionofStructuredLanguageModelPrograms/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/TensorFlow-Asystemforlarge-scalemachinelearning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/TensorFlow-Large-ScaleMachineLearningonHeterogeneousDistribu/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/TowardsGeneralAgenticIntelligenceviaEnvironmentScaling/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/llm-systems/UniversalDeepResearch-BringYourOwnModelandStrategy/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/2208.07339-LLM.int8%28%29-8-bitMatrixMultiplicationforTransformersatScale/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/2510.25602-INTv.s.FP-AComprehensiveStudyofFine-GrainedLow-bitQuantizati/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/2511.02302-FP8-Flow-MoE-A-Casting-Free-FP8-Recipe-without-Double-Quantization-Error/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/2512.10938-StrongerNormalization-FreeTransformers/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/AWQ-Activation-AwareWeightQuantizationforOn-DeviceLLMCompres/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/FP8-LM-TrainingFP8LargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/PretrainingLargeLanguageModelswithNVFP4/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/SageAttention3-MicroscalingFP4AttentionforInferenceandAnExpl/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/TheEraof1-bitLLMs-AllLargeLanguageModelsarein1.58Bits/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/low-precision/%E2%80%9CGiveMeBF16orGiveMeDeath%E2%80%9D%3FAccuracy-PerformanceTrade-OffsinLL/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/moe/2512.14080-SonicMoE-Accelerating-MoE-with-IO-and-Tile-aware-Optimizations/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/moe/2512.23447-Coupling-Experts-and-Routers-in-Mixture-of-Experts-via-an-Auxiliary-Loss/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/moe/2601.05296-MoEBlaze-Breaking-the-Memory-Wall-for-Efficient-MoE-Training-on-Modern-GPUs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/1710.10903-GRAPH-ATTENTION-NETWORKS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2507.13264-Voxtral/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2510.23095-REVISITINGMULTIMODALPOSITIONALENCODINGINVISION-LANGUAGEMODE/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2511.10647-DepthAnything3-RecoveringtheVisualSpacefromAnyViews/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2511.21631-Qwen3-VLTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2601.04720-Qwen3-VL-Embedding-and-Qwen3-VL-Reranker-A-Unified-Framework-for-State-of-the-Ar/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2601.10611-Molmo2Open-Weights-and-Data-for-Vision-Language-Modelswith-Video-Understanding-a/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/2603.25040-Intern-S1-Pro-Scientific-Multimodal-Foundation-Model-at-Trillion-Scale/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/ATOKEN-AUNIFIEDTOKENIZERFORVISION/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/AnEmpiricalStudyofScalingInstruction-TunedLargeMultimodalMod/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/AnIntroductiontoVision-LanguageModeling/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Apollo-AnExplorationofVideoUnderstandinginLargeMultimodalMod/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Buildingandbetterunderstandingvision-languagemodels-insights/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Cube-ARobloxViewof3DIntelligence/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Emu3-Next-TokenPredictionisAllYouNeed/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/GLM-4.5VandGLM-4.1V-Thinking-TowardsVersatileMultimodalReaso/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Gemini-AFamilyofHighlyCapableMultimodalModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/INTERN-S1-ASCIENTIFICMULTIMODALFOUNDATIONMODEL/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/InternVL3-ExploringAdvancedTrainingandTest-TimeRecipesforOpe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/InternVL3.5-AdvancingOpen-SourceMultimodalModelsinVersatilit/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/KIMI-VLTECHNICALREPORT/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/LLaVA-OneVision-1.5-FullyOpenFrameworkforDemocratizedMultimo/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/MMaDA-MultimodalLargeDiffusionLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/MolmoandPixMo-OpenWeightsandOpenDataforState-of-the-ArtVisio/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/NVILA-EfficientFrontierVisualLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/NVLM-OpenFrontier-ClassMultimodalLLMs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/NextTokenPredictionTowardsMultimodalIntelligence-AComprehens/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Phi-4-MiniTechnicalReport-CompactyetPowerfulMultimodalLangua/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Qwen-ImageTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Qwen2-VL-EnhancingVision-LanguageModel%E2%80%99sPerceptionoftheWorld/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Qwen2.5-OmniTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Qwen2.5-VLTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Qwen3-OmniTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/SAIL-VL2TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/ScalingAutoregressiveMulti-ModalModels-PretrainingandInstruc/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/SegmentAnything/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/SmolVLM-Redefiningsmallandefficientmultimodalmodels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/multimodal/Videomodelsarezero-shotlearnersandreasoners/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2201.02177-GROKKING-GENERALIZATION-BEYOND-OVERFITTING-ON-SMALL-ALGORITHMIC-DATASETS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2209.11895-In-context-Learning-and-Induction-Heads/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2306.13891-EstimatingtheCausalEffectofEarlyArXivingonPaperAcceptance/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2409.14254-Instruction-Following-Without-Instruction-Tuning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2501.00958-2.5YearsinClass-AMultimodalTextbookforVision-LanguagePretrai/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2501.05453-AnEmpiricalStudyofAutoregressivePre-trainingfromVideos/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2501.15383-Qwen2.5-1MTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2510.22115-EveryActivationBoosted-ScalingGeneralReasonerto1TrillionOpen/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2512.13687-Towards-Scalable-Pre-training-of-Visual-Tokenizers-for-Generation/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2512.22955-Diversity-or-Precision-A-Deep-Dive-into-Next-Token-Prediction/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2603.27164-daVinci-LLM-Towards-the-Science-of-Pretraining/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/2OLMo2Furious/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/BART-DenoisingSequence-to-SequencePre-trainingforNaturalLang/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/BERT-Pre-trainingofDeepBidirectionalTransformersforLanguageU/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/BLOOM-A176B-ParameterOpen-AccessMultilingualLanguageModel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/CLIMB-CLustering-basedIterativeDataMixtureBootstrappingforLa/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DEMOCRATIZINGOPENANDCOMPLIANTLLMSFORGLOBALLANGUAGEENVIRONMEN/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DINOv3/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DataComp-LM-Insearchofthenextgenerationoftrainingsetsforlang/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DataDecide-HowtoPredictBestPretrainingDatawithSmallExperimen/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DeepSeek-Coder-V2-BreakingtheBarrierofClosed-SourceModelsinC/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DeepSeek-Coder-WhentheLargeLanguageModelMeetsProgramming-The/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DeepSeekLLM-ScalingOpen-SourceLanguageModelswithLongtermism/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DeepSeekMath-PushingtheLimitsofMathematicalReasoninginOpenLa/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/DistributedRepresentationsofWordsandPhrasesandtheirCompositi/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Dolma-anOpenCorpusofThreeTrillionTokensforLanguageModelPretr/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/EfficientEstimationofWordRepresentationsinVectorSpace/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/EmbeddingGemma-PowerfulandLightweightTextRepresentations/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/FantasticPretrainingOptimizersandWheretoFindThem/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/GeminiEmbedding-GeneralizableEmbeddingsfromGemini/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Gemma2-ImprovingOpenLanguageModelsataPracticalSize/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/IN-CONTEXTPRETRAINING-LANGUAGEMODELINGBEYONDDOCUMENTBOUNDARI/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/InstructionPre-Training-LanguageModelsareSupervisedMultitask/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/LargeLanguageModelsforCompilerOptimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/MetaCLIP2-AWorldwideScalingRecipe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ModelMerginginPre-trainingofLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/MultilingualE5TextEmbeddings-ATechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Nemotron-415BTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ONLINEARREPRESENTATIONSANDPRETRAININGDATAFREQUENCYINLANGUAGE/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/OPENDATASYNTHESISFORDEEPRESEARCH/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/OPT-OpenPre-trainedTransformerLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/OpenCoder-TheOpenCookbookforTop-TierCodeLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/PaLM2TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Phi-3TechnicalReport-AHighlyCapableLanguageModelLocallyonYou/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Phi-4TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Pre-trainingunderinfinitecompute/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Qwen2.5-CoderTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Qwen2.5TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Qwen3Embedding-AdvancingTextEmbeddingandRerankingThroughFoun/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/RedPajama-anOpenDatasetforTrainingLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/RephrasingtheWeb-ARecipeforCompute%26Data-EfficientLanguageMod/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/RoBERTa-ARobustlyOptimizedBERTPretrainingApproach/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ScalingAgentsviaContinualPre-training/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ScalingLawsforNeuralLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ScalingPre-trainingtoOneHundredBillionDataforVisionLanguageM/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ScalingSyntheticDataCreationwith1%2C000%2C000%2C000Personas/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/SigLIP2-MultilingualVision-LanguageEncoderswithImprovedSeman/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/SmolLM2-WhenSmolGoesBig%E2%80%94Data-CentricTrainingofaSmallLanguage/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/Source2Synth-SyntheticDataGenerationandCurationGroundedinRea/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/THINKINGAUGMENTEDPRE-TRAINING/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/TextbooksAreAllYouNeed/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/TextbooksAreAllYouNeedII-phi-1.5technicalreport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/TheCommonPilev0.1-An8TBDatasetofPublicDomainandOpenlyLicense/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/TheFineWebDatasets-DecantingtheWebfortheFinestTextDataatScal/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/ToCode%2CorNotToCode%3FExploringImpactofCodeinPre-training/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/TrainingCompute-OptimalLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/UnleashingthePowerofDataTsunami-AComprehensiveSurveyonDataAs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/pretraining/olmOCR-UnlockingTrillionsofTokensinPDFswithVisionLanguageMod/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/Chain-of-ThoughtPromptingElicitsReasoninginLargeLanguageMode/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/LanguageModelsareFew-ShotLearners/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/LargeLanguageModelsareZero-ShotReasoners/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/LargeLanguageModelsasOptimizers/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/Meta-Prompting-EnhancingLanguageModelswithTask-AgnosticScaff/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/RethinkingtheRoleofDemonstrations-WhatMakesIn-ContextLearnin/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/prompting/ThePromptReport-ASystematicSurveyofPromptEngineeringTechniqu/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/reasoning/2511.22570-DeepSeekMath-V2-TowardsSelf-VerifiableMathematicalReasoning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/reasoning/2512.10739-Long-horizon-Reasoning-Agent-for-Olympiad-Level-Mathematical-Problem-Solving/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/reasoning/2512.16969-Probing-Scientific-General-Intelligence-of-LLMswith-Scientist-Aligned-Workflows/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/reasoning/2512.17901-WHEN-REASONING-MEETS-ITS-LAWS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/reasoning/2512.23988-Fantastic-Reasoning-Behaviors-and-Where-to-Find-Them-Unsupervised-Discovery-of-t/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/retrieval/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/retrieval/2603.10913-LLM2VEC-GEN-Generative-Embeddings-from-Large-Language-Models/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/retrieval/Infini-gram-ScalingUnboundedn-gramLanguageModelstoaTrillionT/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/retrieval/OntheTheoreticalLimitationsofEmbedding-BasedRetrieval/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/retrieval/ScalingRetrieval-BasedLanguageModelswithaTrillion-TokenDatas/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2507.14843-THEINVISIBLELEASH%3FWHYRLVRMAYORMAYNOTESCAPEITSORIGIN/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2512.01374-StabilizingReinforcementLearningwithLLMs-FormulationandPract/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2512.16649-JustRL-Scaling-a-1.5B-LLM-with-a-Simple-RL-Recipe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2601.05242-GDPO-Group-reward-Decoupled-Normalization-Policy-Optimization-for-Multi-reward-R/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2601.08521-Your-Group-Relative-Advantage-Is-Biased/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2603.08068-In-Context-Reinforcement-Learning-for-Tool-Use-in-Large-Language-Models/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2603.08660-How-Far-Can-Unsupervised-RLVR-Scale-LLM-Training/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2603.14473-AI-Can-Learn-Scientific-Taste/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2603.18815-ProRL-Agent-Rollout-as-a-Service-for-RL-Training-of-Multi-Turn-LLM-Agents/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/2603.21383-PivotRL-High-Accuracy-Agentic-Post-Training-at-Low-Compute-Cost/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/AGENTICREINFORCEDPOLICYOPTIMIZATION/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/ASurveyofReinforcementLearningforLargeReasoningModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/CUDA-L1-ImprovingCUDAOptimizationviaContrastiveReinforcement/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/CompetitiveProgrammingwithLargeReasoningModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/DAPO-AnOpen-SourceLLMReinforcementLearningSystematScale/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/DEEPSEARCH-OVERCOMETHEBOTTLENECKOFREINFORCEMENTLEARNINGWITHV/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/DoesReinforcementLearningReallyIncentivizeReasoningCapacityi/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/GLM-4.5-Agentic%2CReasoning%2CandCoding%28ARC%29FoundationModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/GroupSequencePolicyOptimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/Magistral/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/NEMOTRON-CROSSTHINK-ScalingSelf-LearningbeyondMathReasoning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/Phi-4-reasoningTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/ProximalPolicyOptimizationAlgorithms/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/QERL-BEYONDEFFICIENCY%E2%80%93QUANTIZATIONENHANCEDREINFORCEMENTLEARN/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/ReinforcementLearningforReasoninginLargeLanguageModelswithOn/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/ScalingRLtoLongVideos/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/Search-R1-TrainingLLMstoReasonandLeverageSearchEngineswithRe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/SecretsofRLHFinLargeLanguageModelsPartI-PPO/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/SharingisCaring-EfficientLMPost-TrainingwithCollectiveRLExpe/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/TheArtofScalingReinforcementLearningComputeforLLMs/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/TheLandscapeofAgenticReinforcementLearningforLLMs-ASurvey/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/TrainingLanguageModelstoSelf-CorrectviaReinforcementLearning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/TrustRegionPolicyOptimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/Webscale-RL-AutomatedDataPipelineforScalingRLDatatoPretraini/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/rl-training/ZEROSEARCH-IncentivizetheSearchCapabilityofLLMswithoutSearch/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/safety/2507.05578-TheLandscapeofMemorizationinLLMs-Mechanisms%2CMeasurement%2CandM/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/DistServe-DisaggregatingPrefillandDecodingforGoodput-optimiz/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/EfficientMemoryManagementforLargeLanguageModelServingwithPag/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/MARCONI-PREFIXCACHINGFORTHEERAOFHYBRIDLLMS/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/MegaScale-Infer-ServingMixture-of-ExpertsatScalewithDisaggre/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/NanoFlow-TowardsOptimalLargeLanguageModelServingThroughput/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/PUNICA-MULTI-TENANTLORASERVING/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/PrefillOnly-AnInferenceEngineforPrefill-onlyWorkloadsinLarge/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/S-LoRA-ServingThousandsofConcurrentLoRAAdapters/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/serving/TamingThroughput-LatencyTradeoffinLLMInferencewithSarathi-Se/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/2501.17161-SFTMemorizes%2CRLGeneralizes-AComparativeStudyofFoundationMode/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/2511.10643-Black-BoxOn-PolicyDistillationofLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/2601.00417-Deep-Delta-Learning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/2603.13985-Supervised-Fine-Tuning-versus-Reinforcement-Learning-A-Study-of-Post-Training-Me/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/2604.00626-A-Survey-of-On-Policy-Distillation-for-Large-Language-Models/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ADAM-AMETHODFORSTOCHASTICOPTIMIZATION/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/AgentLearningviaEarlyExperience/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/Better%26FasterLargeLanguageModelsviaMulti-tokenPrediction/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/BeyondDataandModelParallelismforDeepNeuralNetworks/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/BeyondHumanData-ScalingSelf-TrainingforProblem-SolvingwithLa/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/BeyondReasoningGains-MitigatingGeneralCapabilitiesForgetting/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/CriticalBatchSizeRevisited-ASimpleEmpiricalApproachtoLarge-B/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/DCP-AddressingInputDynamismInLong-ContextTrainingviaDynamicC/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/EXAONE3.5-SeriesofLargeLanguageModelsforReal-worldUseCases/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/Eagle2.5-BoostingLong-ContextPost-TrainingforFrontierVision-/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/EfficientLarge-ScaleLanguageModelTrainingonGPUClustersUsingM/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/EfficientLong-contextLanguageModelTrainingbyCoreAttentionDis/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/Hermes4TechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/LIMI-LessisMoreforAgency/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/LIMO-LessisMoreforReasoning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/LigerKernel-EfficientTritonKernelsforLLMTraining/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/LoRA-Low-RankAdaptationofLargeLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/MM1.5-Methods%2CAnalysis%26InsightsfromMultimodalLLMFine-tuning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/MergetoLearn-EfficientlyAddingSkillstoLanguageModelswithMode/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/MuonOutperformsAdaminTail-EndAssociativeMemoryLearning/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ON-POLICYDISTILLATIONOFLANGUAGEMODELS-LEARNINGFROMSELF-GENER/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ONTHEGENERALIZATIONOFSFT-AREINFORCEMENTLEARNINGPERSPECTIVEWI/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/Parameter-EfficientTransferLearningforNLP/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/PipeDream-FastandEfficientPipelineParallelDNNTraining/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/Pre-trainingDistillationforLargeLanguageModels-ADesignSpaceE/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/PyTorchFSDP-ExperiencesonScalingFullyShardedDataParallel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/RAFT-AdaptingLanguageModeltoDomainSpecificRAG/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ROBUSTFT-RobustSupervisedFine-tuningforLargeLanguageModelsun/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ReFT-RepresentationFinetuningforLanguageModels/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ReinforcementPre-Training/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/SelfForcing-BridgingtheTrain-TestGapinAutoregressiveVideoDif/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/TongyiDeepResearchTechnicalReport/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/ZeRO-MemoryOptimizationsTowardTrainingTrillionParameterModel/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/doi-10.48550-arxiv.1406.2661-Generative-Adversarial-Nets/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/doi-10.5555-1953048.1953056-Random-Search-for-Hyper-Parameter-Optimization/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/training-methods/doi-10.5555-3045118.3045280-Dropout-A-Simple-Way-to-Prevent-Neural-Networks-from-Overfitting/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/uncategorized/1409.4842-Goingdeeperwithconvolutions/ 2026-06-01 https://summary-of-some-paper-in-cuda.readthedocs.io/vision/2302.00294-The-geometry-of-hidden-representations-of-large-transformer-models/ 2026-06-01