#memory-management

[ follow ]
#kubernetes
fromMedium
2 weeks ago
DevOps

Why Swap on Kubernetes Isn't the Same as Swap on Linux-and What You Should Do Instead

Swap behaves differently in Kubernetes than in traditional Linux server environments.
Enabling swap alone does not prevent OOMKills or pod evictions in Kubernetes environments.
fromMedium
2 weeks ago
DevOps

Why Swap on Kubernetes Isn't the Same as Swap on Linux-and What You Should Do Instead

Kubernetes handles memory management and swap differently than traditional Linux, leading to potential failures when using swap.
fromMedium
2 weeks ago
DevOps

Why Swap on Kubernetes Isn't the Same as Swap on Linux-and What You Should Do Instead

Swap behaves differently in Kubernetes than in traditional Linux server environments.
Enabling swap alone does not prevent OOMKills or pod evictions in Kubernetes environments.
fromMedium
2 weeks ago
DevOps

Why Swap on Kubernetes Isn't the Same as Swap on Linux-and What You Should Do Instead

Kubernetes handles memory management and swap differently than traditional Linux, leading to potential failures when using swap.
more#kubernetes
#software-development
fromPycoders
3 months ago
Python

PyCoder's Weekly | Issue #665

MindsDB streamlines the creation of AI agents by overcoming challenges related to legacy data systems and complex queries.
fromHackernoon
1 month ago
Miscellaneous

What You Have to Know About the New Features in Go 1.24: Cleanups and Weak Pointers | HackerNoon

Go 1.24 introduces runtime.AddCleanup and weak.Pointer for enhanced garbage collection management.
fromPycoders
3 months ago
Python

PyCoder's Weekly | Issue #665

MindsDB streamlines the creation of AI agents by overcoming challenges related to legacy data systems and complex queries.
fromHackernoon
1 month ago
Miscellaneous

What You Have to Know About the New Features in Go 1.24: Cleanups and Weak Pointers | HackerNoon

Go 1.24 introduces runtime.AddCleanup and weak.Pointer for enhanced garbage collection management.
more#software-development
#weakmap
fromJlongster
1 month ago
Scala

Subverting control with weak references

Weak references enhance memory management in JavaScript through constructs like WeakMap and WeakRef.
from2ality
11 months ago
JavaScript

ECMAScript proposal: Symbols as WeakMap keys

Symbols as WeakMap keys allow non-mutating attachment of data, preventing memory leaks.
Scala
fromJlongster
1 month ago

Subverting control with weak references

Weak references enhance memory management in JavaScript through constructs like WeakMap and WeakRef.
from2ality
11 months ago
JavaScript

ECMAScript proposal: Symbols as WeakMap keys

Symbols as WeakMap keys allow non-mutating attachment of data, preventing memory leaks.
more#weakmap
#programming-languages
Scala
fromInfoWorld
2 months ago

TrapC proposal to fix C/C++ memory safety

TrapC enhances C's capabilities by improving safety and memory management while maintaining C's minimalist design.
fromInfoQ
3 months ago
JavaScript

Modeling Complex Data Structure in Golang Using Pointers, References, and Reverse Indices

Go pointers and references are essential for building efficient systems in the Go language.
Understanding pointers can be complicated, but practical projects can aid learning.
fromHackernoon
10 months ago
JavaScript

A Simplified Comparison: Rust and Pointers | HackerNoon

Rust ensures memory safety through its unique ownership and borrowing model, mitigating risks present in traditional languages.
Scala
fromInfoWorld
2 months ago

TrapC proposal to fix C/C++ memory safety

TrapC enhances C's capabilities by improving safety and memory management while maintaining C's minimalist design.
JavaScript
fromInfoQ
3 months ago

Modeling Complex Data Structure in Golang Using Pointers, References, and Reverse Indices

Go pointers and references are essential for building efficient systems in the Go language.
Understanding pointers can be complicated, but practical projects can aid learning.
fromHackernoon
10 months ago
JavaScript

A Simplified Comparison: Rust and Pointers | HackerNoon

Rust ensures memory safety through its unique ownership and borrowing model, mitigating risks present in traditional languages.
more#programming-languages
fromRealpython
3 months ago
JavaScript

How to Split a Python List or Iterable Into Chunks - Real Python

Splitting a long list into fixed-size chunks can enhance performance and manageability in programming and data transfer.
#software-engineering
fromInfoQ
4 months ago
JavaScript

Challenges of Creating iOS App Extensions at Lyft

Lyft engineers efficiently manage iOS app extension development by optimizing dependencies, binary size, and memory usage while adhering to Apple's constraints.
fromHackernoon
3 months ago
JavaScript

RRR-Rust, RAR, and RAC: The Trifecta for Co-Design Excellence | HackerNoon

Rust offers improved type safety and memory management through its ownership model.
Restricted Algorithmic Rust serves as a bridge from C/C++ to RAC while maintaining Rust's proper syntax.
fromInfoQ
4 months ago
JavaScript

Challenges of Creating iOS App Extensions at Lyft

Lyft engineers efficiently manage iOS app extension development by optimizing dependencies, binary size, and memory usage while adhering to Apple's constraints.
fromHackernoon
3 months ago
JavaScript

RRR-Rust, RAR, and RAC: The Trifecta for Co-Design Excellence | HackerNoon

Rust offers improved type safety and memory management through its ownership model.
Restricted Algorithmic Rust serves as a bridge from C/C++ to RAC while maintaining Rust's proper syntax.
more#software-engineering
JavaScript
fromHackernoon
3 months ago

How I Solved the WebSocket Scaling Problem Without Breaking the Bank | HackerNoon

WebSockets enable scalable systems for handling many connections efficiently with proper connection management.
#pagedattention
fromHackernoon
1 year ago
Miscellaneous

Evaluating vLLM With Basic Sampling | HackerNoon

vLLM outperforms other models in handling higher request rates while maintaining low latencies through efficient memory management.
fromHackernoon
1 year ago
Miscellaneous

How vLLM Implements Decoding Algorithms | HackerNoon

vLLM optimizes large language model serving through innovative memory management and GPU techniques.
fromHackernoon
1 year ago
Miscellaneous

PagedAttention: An Attention Algorithm Inspired By the Classical Virtual Memory in Operating Systems | HackerNoon

PagedAttention optimizes memory usage in language model serving, significantly improving throughput while minimizing KV cache waste.
fromHackernoon
1 year ago
Miscellaneous

How Good Is PagedAttention at Memory Sharing? | HackerNoon

Memory sharing in PagedAttention enhances efficiency in LLMs, significantly reducing memory usage during sampling and decoding processes.
fromHackernoon
1 year ago
Miscellaneous

Our Method for Developing PagedAttention | HackerNoon

PagedAttention optimizes memory usage in LLM serving by managing key-value pairs in a non-contiguous manner.
fromHackernoon
1 year ago
Miscellaneous

Decoding With PagedAttention and vLLM | HackerNoon

vLLM optimizes memory management in LLM decoding by reserving only necessary resources, improving efficiency and performance.
fromHackernoon
1 year ago
Miscellaneous

Evaluating vLLM With Basic Sampling | HackerNoon

vLLM outperforms other models in handling higher request rates while maintaining low latencies through efficient memory management.
fromHackernoon
1 year ago
Miscellaneous

How vLLM Implements Decoding Algorithms | HackerNoon

vLLM optimizes large language model serving through innovative memory management and GPU techniques.
fromHackernoon
1 year ago
Miscellaneous

PagedAttention: An Attention Algorithm Inspired By the Classical Virtual Memory in Operating Systems | HackerNoon

PagedAttention optimizes memory usage in language model serving, significantly improving throughput while minimizing KV cache waste.
fromHackernoon
1 year ago
Miscellaneous

How Good Is PagedAttention at Memory Sharing? | HackerNoon

Memory sharing in PagedAttention enhances efficiency in LLMs, significantly reducing memory usage during sampling and decoding processes.
fromHackernoon
1 year ago
Miscellaneous

Our Method for Developing PagedAttention | HackerNoon

PagedAttention optimizes memory usage in LLM serving by managing key-value pairs in a non-contiguous manner.
fromHackernoon
1 year ago
Miscellaneous

Decoding With PagedAttention and vLLM | HackerNoon

vLLM optimizes memory management in LLM decoding by reserving only necessary resources, improving efficiency and performance.
more#pagedattention
#large-language-models
fromHackernoon
6 months ago
Miscellaneous

Practical LLMs for Real-World Applications | HackerNoon

Anchor-based LLMs reduce memory use by 99% while improving inference speed by up to 3.5 times, enabling practical use on resource-constrained devices.
fromHackernoon
1 year ago
Miscellaneous

How Effective is vLLM When a Prefix Is Thrown Into the Mix? | HackerNoon

vLLM significantly improves throughput in LLM tasks by utilizing shared prefixes among different input prompts.
fromHackernoon
6 months ago
Miscellaneous

Practical LLMs for Real-World Applications | HackerNoon

Anchor-based LLMs reduce memory use by 99% while improving inference speed by up to 3.5 times, enabling practical use on resource-constrained devices.
fromHackernoon
1 year ago
Miscellaneous

How Effective is vLLM When a Prefix Is Thrown Into the Mix? | HackerNoon

vLLM significantly improves throughput in LLM tasks by utilizing shared prefixes among different input prompts.
more#large-language-models
#transformer-models
fromHackernoon
1 year ago
Data science

Evaluating the Performance of vLLM: How Did It Do? | HackerNoon

vLLM was tested using various Transformer-based large language models to evaluate its performance under load.
fromHackernoon
1 year ago
Miscellaneous

The Generation and Serving Procedures of Typical LLMs: A Quick Explanation | HackerNoon

Transformer-based language models use autoregressive approaches for token sequence probability modeling.
fromHackernoon
1 year ago
Miscellaneous

Batching Techniques for LLMs | HackerNoon

Batching improves compute utilization for LLMs, but naive strategies can cause delays and waste resources. Fine-grained batching techniques offer a solution.
fromHackernoon
1 year ago
Data science

Evaluating the Performance of vLLM: How Did It Do? | HackerNoon

vLLM was tested using various Transformer-based large language models to evaluate its performance under load.
fromHackernoon
1 year ago
Miscellaneous

The Generation and Serving Procedures of Typical LLMs: A Quick Explanation | HackerNoon

Transformer-based language models use autoregressive approaches for token sequence probability modeling.
fromHackernoon
1 year ago
Miscellaneous

Batching Techniques for LLMs | HackerNoon

Batching improves compute utilization for LLMs, but naive strategies can cause delays and waste resources. Fine-grained batching techniques offer a solution.
more#transformer-models
#vllm
fromHackernoon
1 year ago
Miscellaneous

The Distributed Execution of vLLM | HackerNoon

Large Language Models often exceed single GPU limits, requiring advanced distributed execution techniques for memory management.
fromHackernoon
1 year ago
Miscellaneous

How vLLM Prioritizes a Subset of Requests | HackerNoon

vLLM utilizes FCFS scheduling and an all-or-nothing eviction policy to effectively manage resources and prioritize fairness in request handling.
fromHackernoon
1 year ago
Miscellaneous

The Distributed Execution of vLLM | HackerNoon

Large Language Models often exceed single GPU limits, requiring advanced distributed execution techniques for memory management.
fromHackernoon
1 year ago
Miscellaneous

How vLLM Prioritizes a Subset of Requests | HackerNoon

vLLM utilizes FCFS scheduling and an all-or-nothing eviction policy to effectively manage resources and prioritize fairness in request handling.
more#vllm
#deep-learning
fromHackernoon
1 year ago
JavaScript

Pytorch Contiguous Tensor Optimization | HackerNoon

Efficient memory management and tensor contiguity are essential for optimizing performance in PyTorch, especially when handling large-scale datasets.
fromHackernoon
1 year ago
Miscellaneous

PagedAttention: Memory Management in Existing Systems | HackerNoon

Current LLM serving systems inefficiently manage memory, resulting in significant waste due to fixed size allocations based on potential maximum sequence lengths.
fromHackernoon
1 year ago
JavaScript

Pytorch Contiguous Tensor Optimization | HackerNoon

Efficient memory management and tensor contiguity are essential for optimizing performance in PyTorch, especially when handling large-scale datasets.
fromHackernoon
1 year ago
Miscellaneous

PagedAttention: Memory Management in Existing Systems | HackerNoon

Current LLM serving systems inefficiently manage memory, resulting in significant waste due to fixed size allocations based on potential maximum sequence lengths.
more#deep-learning
#app-compatibility
fromInfoQ
8 months ago
Web design

Android 15 Beta 4 Now Available for Developers to Bring their Apps Up to Date

The final Android 15 beta focuses on stable APIs for developer use and introduces significant behavior changes and new privacy features.
fromDeveloper Tech News
4 months ago
Web design

Google urges Android developers to prep for 16 KB memory page

Android developers must prepare for a 16 KB memory page size upgrade to gain performance benefits of 5-10% in apps and games.
fromInfoQ
8 months ago
Web design

Android 15 Beta 4 Now Available for Developers to Bring their Apps Up to Date

The final Android 15 beta focuses on stable APIs for developer use and introduces significant behavior changes and new privacy features.
fromDeveloper Tech News
4 months ago
Web design

Google urges Android developers to prep for 16 KB memory page

Android developers must prepare for a 16 KB memory page size upgrade to gain performance benefits of 5-10% in apps and games.
more#app-compatibility
fromMedium
5 months ago
JavaScript

How We Saved 12% in Resources with Smarter Heap Management

Memory issues in Scala service were traced to buffer mismanagement, leading to high memory usage and frequent GC runs.
Optimized buffer sizes and investigated long-lived object handling to address memory challenges.
fromHackernoon
1 year ago
Miscellaneous

LLM Service & Autoregressive Generation: What This Means | HackerNoon

LLMs generate tokens sequentially, relying on cached key and value vectors from prior tokens for efficient autoregressive generation.
fromLogRocket Blog
5 months ago
JavaScript

Handling memory leaks in Rust - LogRocket Blog

Rust's ownership and borrowing principles help manage memory but memory leaks can still occur, necessitating careful management by developers.
fromInfoQ
5 months ago
JavaScript

Java 24 to Reduce Object Header Size and Save Memory

JEP 450 optimizes Java heap management by implementing compact object headers, which reduces header size and improves memory efficiency.
fromHackernoon
3 years ago
JavaScript

Chatbot Memory: Implement Your Own Algorithm From Scratch | HackerNoon

Implementing effective memory management is crucial for chatbot development, ensuring fluid and coherent interactions during long conversations.
Digital life
fromThe Verge
6 months ago

Chrome introduces new 'Performance' tools to wrangle the tabs gobbling up your memory

Google introduces new Chrome features for better tab management through performance alerts and enhanced Memory Saver options.
fromVrplumber
7 months ago
JavaScript

Plumbing Life's Depths - Interesting Memory Leak with Python 3.12 for PyOpenGL-accelerate

The PyOpenGL 3.12 test suite's memory leak test fails uniquely on the 76th iteration, suggesting changes in Python's memory management.
fromMedium
8 months ago
JavaScript

Improve the performance of your Java application by using these optimizations

Optimize string concatenation with StringBuilder or StringBuffer.
Use local variables for frequently accessed data.
Optimize loops by moving invariant calculations outside.
Use switch statements for better performance.
#data-structures
fromHackernoon
1 year ago
JavaScript

Augmented Linked Lists: An Essential Guide | HackerNoon

Linked lists are efficient for fast addition of data without resizing the entire array, suitable for write-only data, and organizing data for sequential reads.
fromHackernoon
1 year ago
JavaScript

Augmented Tree Data Structures | HackerNoon

Data structures are key to efficient data storage and organization, crucial for memory management and optimizing software performance.
fromHackernoon
1 year ago
JavaScript

Augmented Linked Lists: An Essential Guide | HackerNoon

Linked lists are efficient for fast addition of data without resizing the entire array, suitable for write-only data, and organizing data for sequential reads.
fromHackernoon
1 year ago
JavaScript

Augmented Tree Data Structures | HackerNoon

Data structures are key to efficient data storage and organization, crucial for memory management and optimizing software performance.
more#data-structures
fromAlvinalexander
3 years ago
Scala

How to control Java heap size (memory) allocation (xmx, xms)

Use -Xmx to specify maximum heap size and control RAM use in Java programs.
[ Load more ]