Standard Search

Understanding INT4, INT8, FP16, BF16, and TF32 formats in machine learning - their precision, speed, and memory trade-offs for training and inference.

aiml math research

What do GPT-OSS and Gemma 3 really offer?

https://luminary.blog/techs/gptoss-and-gemma3 · 19 Aug 2025

GPT-OSS and Gemma 3: two new small-but-powerful language models pushing the boundaries.

aiml llm model research

What are Positional Embeddings?

https://luminary.blog/techs/positional-embeddings · 4 Aug 2025

The mathematical technique that teaches AI models where each word sits in a sequence.

aiml llm embeddings tokens

Words, Tokens and Embeddings

https://luminary.blog/techs/09-token-to-embedding · 31 Jul 2025

How language models convert token IDs into meaningful vector representations that capture semantic relationships.

aiml llm embeddings tokens

Subword Tokenization Algorithms

https://luminary.blog/techs/08-subword-tokenization-algorithms · 30 Jul 2025

Understanding the algorithms behind tokenization in Large Language Models.

aiml llm tokenization research

What is LLM Inference?

https://luminary.blog/techs/07-llm-inference · 29 Jul 2025

Understanding how Large Language Models generate text through the inference process.

aiml llm inference