TAG · #OPTIMIZATION

#optimization

30 items

HOTNESS

Lotus: Optimized Agentic and LLM Bulk Processing
3.0
Lotus is an open-source framework for high-performance bulk AI tasks, offering up to 30x speedups for LLM-based jobs and optimized agentic workflows compared to standard approaches.
hnJul 3, 2026#Tech
High-performance Rust: Understanding and eliminating memory fragmentation
4.0
The article explains how memory fragmentation degrades Rust application performance and covers strategies to eliminate it, such as using custom allocators, pre-allocating memory, and designing data structures that minimize heap allocations.
hnJul 2, 2026#Tech
Rust sort_unstable_by with more complex closure unexpectedly shrunk binary
0.5
A Rust developer discovered that adding a boolean argument and an if-statement inside a sort_unstable_by closure unexpectedly reduced the compiled binary size by about 1KB, with the quicksort function alone shrinking by 980 bytes. The cause remains unclear despite similar inlining, and the effect was observed on the arm-unknown-linux-gnueabihf target.
hnJul 2, 2026#Tech
What Should We Optimize Away?
1.5
The article argues that modern optimization of productivity and efficiency often misses what truly matters, suggesting we should instead focus on optimizing away distraction, busyness, and unnecessary complexity to make room for deeper thinking and meaningful work.
hnJul 2, 2026#Tech
Making Optimization Work When Labels Are Scarce
1.5
Gnosys Labs developed a safety classifier for a content-moderation client using small language models (SLMs) with sparse labels, achieving 99% precision on biased/inappropriate comments. The approach combined data-augmented fine-tuning, iterative annotation, and one-vs-rest classification, enabling effective moderation of 85k+ daily comments without a full-time review team.
hnJul 2, 2026#Tech
Optimization tales with CockroachDB: the slow logout
2.0
The article describes investigating a slow logout issue in a platform using CockroachDB. The problem was caused by a query that scanned too many rows due to missing or suboptimal indexes. The author details the optimization process, which involved analyzing the query plan and adding proper indexes to reduce the query time from seconds to milliseconds.
hnJul 1, 2026#Tech
Predictable GRPO
3.0
The paper introduces Predictable GRPO, a modification to Group Relative Policy Optimization (GRPO) that improves training stability and performance in reinforcement learning for language models by using a predictive baseline to reduce variance in advantage estimation.
hnJul 1, 2026#Tech
Former Microsoft engineer shrinks Notepad down to size
1.0
A former Microsoft engineer has created a smaller, more lightweight alternative to Windows Notepad, aiming to revive the simplicity of the classic text editor while keeping it functional on modern systems.
hnJul 1, 2026#Tech
A Multi-Dimensional, Per-Pass Empirical Study of the LLVM Optimization Pipeline
3.0
Researchers conducted an empirical study analyzing the LLVM optimization pipeline on a per-pass basis across multiple dimensions. The work provides detailed insights into how individual optimization passes affect code performance and compilation metrics, offering a granular understanding of compiler optimization behavior.
hnJul 1, 2026#Tech
GitHub: The uphill climb of making diff lines performant
3.0
GitHub engineers detail the performance challenges and optimizations behind rendering diff lines in pull requests, focusing on reducing browser reflows, improving scrolling smoothness, and handling large diffs more efficiently through techniques like virtualization and DOM batching.
hnJul 1, 2026#Tech
Former Microsoft engineer shrinks Notepad down to size
2.0
A former Microsoft engineer has reportedly created a smaller, more streamlined version of Windows Notepad, reducing its footprint and resource usage compared to the original application bundled with the OS.
hnJun 30, 2026#Tech
High Performance Swift Apps
2.0
The article explores techniques for building high-performance Swift applications, covering optimization strategies such as efficient memory management, leveraging value types, using Instruments for profiling, and writing concurrent code with Swift's modern concurrency features like async/await and actors to improve app responsiveness and speed.
hnJun 30, 2026#Tech
Data Access Patterns That Makes Your CPU Angry
3.0
The article explores how different data access patterns impact CPU performance, demonstrating that non-sequential memory access (like random or strided access) can drastically slow down programs compared to sequential access, due to CPU caching behavior and prefetching mechanisms.
hnJun 29, 2026#Tech
What Should We Optimize Away?
2.0
The article explores whether our drive to optimize every aspect of life (time, efficiency, productivity) might cause us to lose valuable human experiences like boredom, unhurried thought, and serendipity, questioning what we are inadvertently optimizing away in the process.
hnJun 29, 2026#Tech
LLM Optimization
3.0
The video discusses techniques for optimizing large language models (LLMs), covering methods such as quantization, pruning, and knowledge distillation to reduce model size and improve inference speed while maintaining performance.
hnJun 29, 2026#Tech
50K-token context reloads, replaced by a 2K local memory engine
2.5
A new project called iai-personal-memory-engine replaces large 50K-token context reloads with a more efficient 2K local memory engine, aiming to improve performance and reduce overhead in AI personal memory systems.
hnJun 28, 2026#Tech
What data access pattern is as slow as possible?
1.0
The author benchmarks memory access patterns and finds that sequential access with a stride equal to the CPU page size (4K) can be as slow as random access due to TLB misses, negating the benefit of sequential memory access.
hnJun 28, 2026#Tech
Integer programming easily encloses horse
2.0
The article explores using integer programming to solve a logic puzzle about enclosing a horse within a fence, demonstrating how constraints can be formulated to find an optimal solution.
hnJun 28, 2026#Tech
Data Access Patterns That Makes Your CPU Angry
3.0
The article discusses how inefficient data access patterns, such as non-sequential memory access and pointer chasing, can severely degrade CPU performance by causing cache misses, and explains that understanding and optimizing for cache-friendly access patterns is crucial for writing efficient code.
hnJun 27, 2026#Tech
Smooth-Maximum, the most useful function [video]
3.0
This video explains the smooth-maximum function, a differentiable approximation of the maximum function used in optimization and machine learning. It covers how smooth-max works, its mathematical properties, and practical applications where it is useful for gradient-based methods.
hnJun 27, 2026#Tech
Data Access Patterns That Makes Your CPU Really Angry
3.5
Non-contiguous data access patterns like linked list traversal cause severe CPU cache misses and performance slowdown, as the CPU's prefetcher is optimized for sequential memory access in arrays.
hnJun 27, 2026#Tech
Simplex Algorithm
1.0
The simplex algorithm is a method for solving linear programming problems. It iteratively moves along the edges of a polytope to find the optimal vertex solution, and while it has exponential worst-case complexity, it performs efficiently in practice.
hnJun 27, 2026#Science
VRAM Ghost Busting: Who You Gonna Close()?
1.0
The article discusses GPU memory management for AI workloads, explaining how to efficiently allocate and deallocate VRAM (video RAM) to prevent memory fragmentation and leakage. It introduces techniques like proper CUDA caching, tensor garbage collection, and using context managers to ensure memory is freed promptly, helping developers optimize VRAM usage in machine learning applications.
hnJun 26, 2026#Tech
Making this site a little faster
1.0
The author describes optimizations made to improve their website's loading speed, including reducing page weight, optimizing images, and deferring non-critical scripts.
hnJun 26, 2026#Tech
Show HN: Vibesolve.ai – Turn plain English into Timefold code
3.5
VibeSolve is an open-source tool that converts plain-English descriptions of optimization problems into Timefold code. Currently suited for rapid prototyping rather than production use, it is designed to explore where LLMs can add value or hinder optimization algorithm development.
hnJun 25, 2026#Tech
I cut GTA Online loading times by 70% (2021)
4.0
A developer investigated why Grand Theft Auto Online took up to six minutes to load, discovered a single-threaded CPU bottleneck caused by a 10MB JSON file being parsed inefficiently with O(n²) complexity, and published a fix that reduced loading times by 70% for many players.
hnJun 25, 2026#Tech
Absurdly Optimized
1.0
Absurdly Optimized is a blog or site dedicated to exploring extreme, unconventional, and highly efficient optimization strategies across various domains like software, productivity, and systems engineering, pushing the boundaries of what can be made faster or more efficient.
hnJun 25, 2026#Other
Practical maximum clique search on 300–900 vertex graphs in a few seconds
3.0
A MathOverflow question asks about practical algorithms for finding maximum cliques in graphs of 300–900 vertices within seconds, noting that while research focuses on 1000+ vertex graphs with heuristics, the challenge here is exact solutions for smaller graphs with limited computation time.
hnJun 24, 2026#Science
Multi-temporal Sentinel-2 super-resolution by optimization
4.0
Multi-temporal Sentinel-2 super-resolution by optimization: a method that enhances the spatial resolution of Sentinel-2 satellite imagery by combining multiple images of the same location taken at different times.
hnJun 24, 2026#Tech
The Fastest Python Struct?
0.5
The article benchmarks various Python struct implementations (dataclasses, namedtuples, attrs, pydantic, msgspec, and custom __slots__ classes) to determine which is fastest for creation, access, and serialization. It finds that msgspec offers the best overall performance, while custom classes with __slots__ are also highly efficient for basic use cases.
hnJun 24, 2026#Tech

Load next 30Updated —