Posted 2025-09-10Algorithm / python / joba minute read (About 193 words)

Learn more about me and the cool products I have contributed to!

Learn more about me if you are interested

Posted 2024-02-02mlsys6 minutes read (About 870 words)

SpotServe, Serving Generative Large Language Models on Preemptible Instances

Notes

Posted 2024-02-02vector_db6 minutes read (About 944 words)

Milvus, A Purpose-Built Vector Data Management System

Notes

Posted 2023-06-28db_index / predictive19 minutes read (About 2894 words)

<note>Tiresias：Enabling Predictive Autonomous Storage and Indexing

...

Posted 2023-05-22ubuntu8 minutes read (About 1206 words)

Installing GreenPlum & Python3.7 on Ubuntu Server

A detailed tutorial for installing Greenplum

Posted 2026-02-072 minutes read (About 286 words)

CUDA编程 2. Elementwise

本篇参考LeetCUDA的实现

Elementwise 操作

Elementwise是一类算子的统称，所谓算子（Operator），就是一个执行特定数学运算的函数单元，可以类比数据库的Operator。

Elementwise类的算子特征就是每一个位置的元素只喝对应位置的元素进行计算，互不干扰，换句话说就是“位置对齐，各算各的”。因此，不难发现我们上一片中定义的vectorAdd核函数就是一个Elementwise算子。

下面让我们用一些例子来透彻理解Elementwise算子

Elementwise Add

首先，对于普通的fp32的elementwise add，和我们上一篇写的vectorAdd是一样的。在此就不赘述了。

这个方法还可以进行优化，如下：

__global__ void elementwise_add_fp32x4(float *a, float *b, float *c, int numElements){
  int idx = 4 * (blockDim.x * blockIdx.x + threadIdx.x);
  if (idx < numElements){
    float4 group_a = reinterpret_cast<float4*>(a)
  }
}

关于向量化访存，可以参考这篇文章

Posted 2026-02-07gpu / C++ / cuda13 minutes read (About 1937 words)

CUDA编程 1. GPU架构和入门程序

GPU Architecture and CUDA Programming

Posted 2026-01-15agent5 minutes read (About 793 words)

How to build a PDF Autofiller Agent?

Notes

Posted 2025-09-19Programming Languages9 minutes read (About 1414 words)

From C# and Python -- A Deep Dive into Language Execution and Typing Models

Starting from the difference between C# and Python...

Posted 2025-09-10Algorithm / python / job5 minutes read (About 762 words)

Binary Search All in One

A Simple and Clear Induction and Summary of Binary Search

CUDA编程 2. Elementwise

Elementwise 操作

Elementwise Add

Links

Categories

Tags

Subscribe for updates

follow.it

Recents

Archives