Topic

Machinelearning

50 items across the graph · 47 news stories — tagged with Machinelearning.

Latest news

NewsReddit r/MachineLearningLive · just now

How papers are selected for Best Paper, Oral, or Highlight presentation at major ML/CV conferences such as CVPR, ICCV, ECCV, NeurIPS, and ICLR? [D]

From what I understand, reviewers usually do not directly vote for these categories or nominate papers themselves. So how does the selection process typically work? Here are specific questions I wonder - Who actually selects the candidates: ACs, SACs, program chairs, award committees, or a separate committee? - Do ACs…

Read full story →

More news · 46

NewsReddit r/MachineLearningLive · just now

Tom Yeh's AI by hand? is it worth it? [D]

Thinking about getting two months at his website and getting a stronger understanding of machine learning since I am building tools with ai models from hugging face. Have anyone tried it? submitted by /u/TheSmashingChamp [link]

Read full story →
NewsReddit r/MachineLearningLive · just now

H64LM: A 249M-parameter Mixture-of-Experts Transformer built from scratch in PyTorch [P]

Hi everyone, I built H64LM, a research project to better understand modern LLMs by implementing one from scratch in PyTorch. Instead of relying on high-level training frameworks, I implemented the core components myself attention, MoE routing, normalization, and the training loop…

Read full story →
NewsReddit r/MachineLearningLive · just now

[D] Self-Promotion Thread

Please post your personal projects, startups, product placements, collaboration needs, blogs etc. Please mention the payment and pricing requirements for products and services. Please do not post link shorteners, link aggregator websites , or auto-subscribe links. -- Any abuse of…

Read full story →
NewsReddit r/MachineLearningLive · 1h ago

Hamiltonian Neural Networks from a Differential Geometry Perspective [D]

Read full story →
NewsReddit r/MachineLearningLive · 2h ago

Improving machine-translated novels via style transfer — looking for advice on the faithfulness/fluency tradeoff [P]

Hey all. I recently started working on a project to improve machine-translated webnovels via style transfer. The basic idea is to take the clunky translated prose and rewrite it to something that reads like it was written by a professional author, while remaining as faithful as p…

Read full story →
NewsReddit r/MachineLearningLive · 8h ago

17KB, zero GPU, zero LLM: a Markov-chain experiment that outperforms baselines on Collatz and prime gaps. [D]

How far can a single Markov chain primitive go if you apply it at every level — byte, word, token, action, decision — with the same method call? The result is MCR, a ~2000-line experiment: MCR("byte").aprender(a, b) MCR("palavra").aprender(a, b) MCR("decisao").aprender(a, b) Same…

Read full story →
NewsReddit r/MachineLearningLive · 19h ago

First Time Submitter to ArXiv and Need an Endorser [D]

Hi a I'm submitting my first arXiv paper in cs.AI. arXiv requires first-time submitter(s) to have an endorser. Is anyone here active in cs.ai or similar such as emerging tech, agents, etc. and willing to take a 30-second look at the abstract and click endorse if it's a fit? I can…

Read full story →
NewsReddit r/MachineLearningLive · 21h ago

SentryCode: Real-time Auditor + Honeytokens for AI Coding Agents [P]

In light of recent privacy concerns arising from local AI coding agents performing telemetry, environmental scanning, and hidden cue fingerprinting, I've open-sourced SentryCode—a kernel-level behavior auditing tool. It logs file/network/cue activity, uses honeypot tokens for zer…

Read full story →
NewsReddit r/MachineLearningLive · yesterday

IN 2026 ML BOOK OUTDATED? [D]

Hello everyone. I wanted to ask if the below book is a good source of machine learning in 2026 or if it is now outdated and if there are better options. Hands-on Machine Learning with. Scikit-Learn, Keras, and TensorFlow : The Third Edition (1200 pages). submitted by /u/Elyartake…

Read full story →
NewsReddit r/MachineLearningLive · 2d ago

How to describe a model that has higher accuracy with fewer #param and FLOPs? [D]

Hello, My supervisor is nowhere to be found so I am turning to the internet for my naive questions. submitted by /u/obliviousphoenix2003 [link]

Read full story →
NewsReddit r/MachineLearningLive · 2d ago

[D] Looking for Machine Learning / Deep Learning Final Year Project Ideas[D]

Hi everyone, I'm a final-year Computer Engineering student looking for ideas for my major engineering project. I'm interested in building something that uses Machine Learning or Deep Learning and is technically strong enough for a final-year project. I'd like it to solve a real-w…

Read full story →
NewsReddit r/MachineLearningLive · 2d ago

[D] Simple Questions Thread

Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead! Thread will stay alive until next one so keep posting after the date in the title. Thanks to everyone for answering questions in the previou…

Read full story →
NewsReddit r/MachineLearningLive · 2d ago

80TB+ of astronomy for the HDD-poor: crossmatch the Universe from your laptop [R]

Today is the day you (🫵!) get access to 80TB plus of data from over 30 astronomical surveys in one place. 4GB of RAM is enough even at Gaia Scale. Check out our writeup here: https://huggingface.co/blog/hugging-science/multimodal-universe-hats And a tutorial here https://asciine…

Read full story →
NewsReddit r/MachineLearningLive · 3d ago

Are all LLM research papers nowadays 100+ pages beasts?[D]

Was reading some research papers put out by Anthropic (and some other organizations/researchers) and one thing I've noticed is that these research papers consistently all share the same quality: Oftentimes over 100 pages of pure words, interspersed with screenshots of very dense/…

Read full story →
NewsReddit r/MachineLearningLive · 3d ago

I built an open-source memory governance layer for AI assistants - looking for technical feedback [P]

I’ve been working on a project called MemoryOps AI. The problem I’m trying to solve is context debt in AI agents. Most memory demos look like this: chat message → vector database → retrieve later That works for demos, but I think production agents need more than retrieval. They n…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

Rejected MICCAI paper: workshop -> journal/conference or directly journal/conference [R]

Premise: this work is my first year PhD, and I dropped out for personal reasons. I still want to do research but independently. I have tried to submit my explainability paper to MICCAI. Sadly, for doubtful/good reasons, it got rejected. Among the reviewers, one explicitly suggest…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

I built a demo agricultural planning system with an AI advisor for small-scale farmers in Nicaragua using NASA data [p]

(this was deleted before but i dont know if it was the filters of reddit or the moderators, if is the moderators i will not post it again after you delete it sorry.) (The name will probably change soon because I didn't realize "AgroVision" is already a registered trademark lol.)…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

Adaptive Mixture of Experts Gate (AMG) [R]

[Project] Post-hoc Adaptive MoE Gating on Qwen3.6-35B — empirical benchmarking of an open research gap Adaptive MoE routing — selecting a variable number of experts per token based on routing confidence — has been studied in papers (XMoE 2024, DynMoE ICLR 2025, TopP routing Huang…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

I do historical swordfighting and noticed AI struggles to track it. I’m building an open dataset to help fix this. Does my schema make sense? [P]

Hi everyone, I’m a historical swordfighter (HEMA practitioner), and while I’m not a computer vision engineer or a roboticist, I’ve been reading a lot about the current bottlenecks in embodied AI, specifically around the Sim2Real gap and thin-object tracking. It occurred to me tha…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

Cerebras OpenAI deal capacity has effectively killed the waitlist for everyone else [D]

I’m pretty annoyed. We’re a small AI startup building a real-time coding agent. Our p95 latency requirements are tight (and self imposed, but thats the product). We need sustained high-throughput inference with ~1-2k tokens/second. Been on the Cerebras waitlist for months trying…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

RAGless: Q-Q retrieval with score aggregation for closed-domain FAQ [P]

What it does RAGless is a semantic retrieval system based on Question-to-Question matching. At ingestion, an LLM generates multiple question variants per answer (3–5) and each variant gets its own embedding. At query time, the user question is embedded, Top-K nearest question var…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

ECCV 2026 Final Decisions after Provisional Acceptance [D]

Has anyone actually received final acceptance following their provisional acceptance email from ECCV 2026? I am very confused. Thank you so much. submitted by /u/Land_Heavy [link]

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

[D] Looking for people serious about ML, DL & DSA 🚀[D]

I recently started a Telegram community called The Daily Commit. The goal is simple: stay consistent and hold each other accountable. What we do: 🧠 Share what we learned every day. ❓ Discuss ML, DL & DSA doubts. 📚 Share quality resources. 🚀 Build projects together. 💪 Stay con…

Read full story →
NewsReddit r/MachineLearningLive · 4d ago

Building an AI loan document intelligence platform — shipped fraud detection, hybrid search, and 3-tier email fallback this week[P]

What shipped this week: Real fraud detection (8 heuristic rules) Replaced a no-op stub with actual signal detection: FR-001: Unredacted SSN in document body FR-002: Duplicate content hash (same doc resubmitted under a new name) FR-003: Implausibly round income figures ($50,000.00…

Read full story →
NewsReddit r/MachineLearningLive · 5d ago

Evaluating long-term memory limits in stateless LLM chatbots — feedback needed [D]

Hi all, I’m working on a research project exploring how stateless LLM-based chatbots handle long conversations and whether important earlier information is still reliably retained over time. My idea is to: Run a chatbot using an LLM API without any external memory system Introduc…

Read full story →
NewsReddit r/MachineLearningLive · 5d ago

NagaTranslate: Building a translation and voice pipeline for low-resource Nagaland creoles (Whisper, VITS, LLMs) [P]

Read full story →
NewsReddit r/MachineLearningLive · 6d ago

Benchmarking Self-Hosted Gemma 2 9B vs. Frontier APIs: The FP8 Quantization Prefill Tax and VRAM Realities on an NVIDIA L4 [P]

When evaluating migrating production LLM workloads off commercial cloud APIs, the conversation usually gets oversimplified into a trade-off between quality and infrastructure cost. To look past clean, isolated averages, I built a repeatable evaluation matrix using a real-world wo…

Read full story →
NewsReddit r/MachineLearningLive · 6d ago

Do we still need to study algorithms now that AI writes most of our code? [D]

I've been thinking about this for a while. AI can now write functions, explain code, refactor projects, generate tests, and even solve many programming problems better than many junior developers. I've also noticed that Stack Overflow seems far less active than it used to be beca…

Read full story →
NewsReddit r/MachineLearningLive · 6d ago

Showcase: Building ML models that "watch" MMA fights and label events and positional changes making these moments all searchable on a timeline [P]

Hey all, a bit of background - I'm an ex Amateur MMA fighter and BJJ brown belt and am also in the AI/ML space ... weird combo but wanted to know if anyone else was at the intersection of ML/AI and MMA/BJJ. In short, I'm building AI models that "watch" fights and are able to dete…

Read full story →
NewsReddit r/MachineLearningLive · 6d ago

Kicking off GPU Mode [D]

Hey ! I’m starting a series to document my work on GPU infrastructure, LLMs, and CV. Stop #1 is up: A brief look at why GPUs are the center of the industry, the CPU/GPU divide, and why nvidia-smi is the first place you check when things break. We’ll move past the basics quickly t…

Read full story →
NewsReddit r/MachineLearningLive · 6d ago

I silently break training codes or configs so I made pybench [P]

It is like pytest but for statistical tests: it ensures no regression of your metrics at a statistical level. It manages tedious things such that seeds, past benchmark results, ... Simple CLI working like pytest but with benchmarks/ directory instead of tests/: pybench # 1st time…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Roast my 3-year roadmap: Pivoting from Python/BaaS to AI Infrastructure & Go (Graduating 2029) [D]

I'm a B.Tech student in India graduating in mid-2029. Currently, I know Python, SQL, Docker, basic prompt engineering, and I've built a few LLM apps using BaaS like Supabase/Firebase. I’m running all this on an Intel i5 13th Gen laptop with an RTX 5050 (8GB VRAM). The Pivot: I or…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

What if context compression is a diffusion noise function? Proposal + honest results from untrained-model experiments [R]

I'm proposing a way to handle massive context longer than a model's context window by treating semantic compression as the noise function of a diffusion-like process. Instead of denoising masked tokens into coherent text (like DiffusionGemma or Nemotron-Diffusion do for generatio…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

A debugger for RL reward functions that detects reward hacking during training [P]

While ex

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Live Continual Learning in Machine Learning [D]

My question on live continual learning use cases was removed by moderators here because they think i asked basic level question about live continual learning which i thought is a frontier level research. But anyways. Is anyone interested in talking about continual learning (live)…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Could it be that there aren’t really any medical LLM APIs available right now? [D]

As part of my ablations, I want to generate text with a medical-oriented LLM, and I was surprised to find no exposed APIs for this kind of model. I found models like MedGemma and BioMistral on Hugging Face, but they don’t seem to offer public APIs, and I really don’t want to host…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

I compiled LLM inference pricing across 7 providers — the caching numbers are surprising(spreadsheet included) [R]

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Would having a dedicated programming language specifically for LLMs be a viable solution? [D]

What if there was a new programming language where the meaning of each token was so dense (or perhaps so specific) that an LLM could write robust code with fewer tokens and faster inference? Assuming there’s enough training data, do you think something like this allow an LLM to w…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

MuJoCo derived Simulator for High Fidelity Vision RL training natively on GPU [D]

Hi everyone, For the past couple of weeks I have been working on a simulator project considering the shortcomings of MuJoCo. There are things that people like and also don't like about MuJoCo, like the CPU dependency on MuJoCo which makes the simulation not parallelizable beyond…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

High Dimensional, Dynamic Rotary Positional Embedding [P]

At the end

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Find the best open-source OCR models in one place at Papers with Code [P]

Hi, I've created an overview of the most important OCR benchmarks, along with the top open models, and links to their paper and code: https://paperswithcode.co/tasks/ocr . This week, new OCR models were released by Baidu and Mistral. Baidu released Unlimited OCR , a 3B-parameter…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Kuma: compiling PyTorch models into self-contained WebGPU executables [P]

I've been experimenting with a compiler/runtime project that I'm not entirely sure is a good idea, so I'd love some feedback from people who've worked on deployment systems. The idea is to compile an exported PyTorch model into a self-contained package that contains: graph binary…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Does ML background help or hurt when applying for security roles [D]

Worried recruiters see "ML/AI engineer" on a resume and assume zero security depth, even with real hands on work in the space. Anyone hired into security from a non-traditional background like this — how'd you frame it? submitted by /u/Xorphian

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

CALHippo - Mapping neurons and glial cells in the human brain hippocampus in 3D using SOTA segmentation and density estimation models [R]

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

How're you deploying LLMs in production now-a-days? What's the best and most affordable way? [D]

I've been developing an AI product using LLM APIs (from OpenRouter) but want to deploy an open-source LLM in my own Prod env. which I can control. Few reasons behind this are: - I wanna own the complete stack around my product. - Second I wanna fine-tune the model around my useca…

Read full story →
NewsReddit r/MachineLearningLive · 7d ago

Showcase: geolocating a dashcam video without GPS, only from the footage [P]

Sharing a project I have been working on called Third Eye. It does visual geolocation. Given a video, it figures out where it was filmed using only the image content, and draws the route on a map. Pipeline in short: per frame place recognition against a street imagery index a tra…

Read full story →

From the graph · 3

Related topics