唐祖义 發表於 2023-10-16 00:00:00

Linux 工作队列 workqueue 是什么鬼?

<div id="navCategory"><h5 class="catalogue">目录</h5><ul class="first_class_ul"><li>
        目录</li><li>
        工作队列是什么</li><li>
        驱动程序</li></ul></div><p>
        <img title="Linux 工作队列 workqueue 是什么鬼?" alt="Linux 工作队列 workqueue 是什么鬼?" border="0" src="https://zhuji.jb51.net/uploads/img/202305/55d2269f5f3188a3268fd79aea1754a7.jpg"></p>
<p class="maodian"></p><h2>
        目录</h2>
<ul>
<li>
                工作队列是什么</li>
        <li>
                驱动程序</li>
        <li>
                编译、测试</li>
</ul>
<p>
        别人的经验,我们的阶梯!</p>
<p>
        Linux中断处理可用下图总结:</p>
<p>
        <img title="Linux 工作队列 workqueue 是什么鬼?" alt="Linux 工作队列 workqueue 是什么鬼?" border="0" src="https://zhuji.jb51.net/uploads/img/202305/8e6c4de5a7176c128ac7a1eebfd3f94e.jpg"></p>
<p>
        图中描述了中断处理中的下半部分都有哪些机制,以及如何根据实际的业务场景、限制条件来进行选择。</p>
<p>
        可以看出:这些不同的实现之间,有些是重复的,或者是相互取代的关系。</p>
<p>
        也正因为此,它们之间的使用方式几乎是大同小异,至少是在API接口函数的使用方式上,从使用这的角度来看,都是非常类似的。</p>
<p>
        这篇文章,我们就通过实际的代码操作,来演示一下工作队列(workqueue)的使用方式。</p>
<p class="maodian"></p><h2>
        工作队列是什么</h2>
<p>
        工作队列是Linux操作系统中,进行中断下半部分处理的重要方式!</p>
<p>
        从名称上可以猜到:一个工作队列就好像业务层常用的消息队列一样,里面存放着很多的工作项等待着被处理。</p>
<p>
        <img title="Linux 工作队列 workqueue 是什么鬼?" alt="Linux 工作队列 workqueue 是什么鬼?" border="0" src="https://zhuji.jb51.net/uploads/img/202305/44ec453c61f06019afe0ac97e749584c.jpg"></p>
<p>
        工作队列中有两个重要的结构体:工作队列(workqueue_struct) 和 工作项(work_struct):</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>struct workqueue_struct { </span></span>
</li>
        <li class="alt">
                <span>struct list_head pwqs; /* WR: all pwqs of this wq */ </span>
</li>
        <li class="alt">
                <span>struct list_head list; /* PR: list of all workqueues */ </span>
</li>
        <li class="alt">
                <span>... </span>
</li>
        <li class="alt">
                <span>char name; /* I: workqueue name */ </span>
</li>
        <li class="alt">
                <span>... </span>
</li>
        <li class="alt">
                <span>/* hot fields used during command issue, aligned to cacheline */ </span>
</li>
        <li class="alt">
                <span>unsigned int flags ____cacheline_aligned; /* WQ: WQ_* flags */ </span>
</li>
        <li class="alt">
                <span>struct pool_workqueue __percpu *cpu_pwqs; /* I: per-cpu pwqs */ </span>
</li>
        <li class="alt">
                <span>struct pool_workqueue __rcu *numa_pwq_tbl[]; /* PWR: unbound pwqs indexed by node */ </span>
</li>
        <li class="alt">
                <span>}; </span>
</li>
</ol>
<ol class="dp-xml">
<li class="alt">
                <span><span>struct work_struct { </span></span>
</li>
        <li class="alt">
                <span>atomic_long_t data; </span>
</li>
        <li class="alt">
                <span>struct list_head entry; </span>
</li>
        <li class="alt">
                <span>work_func_t func; // 指向处理函数 </span>
</li>
        <li class="alt">
                <span>#ifdef CONFIG_LOCKDEP </span>
</li>
        <li>
                <span>struct lockdep_map lockdep_map; </span>
</li>
        <li class="alt">
                <span>#endif </span>
</li>
        <li class="alt">
                <span>}; </span>
</li>
</ol>
<p>
        在内核中,工作队列中的所有工作项,是通过链表串在一起的,并且等待着操作系统中的某个线程挨个取出来处理。</p>
<p>
        这些线程,可以是由驱动程序通过 kthread_create 创建的线程,也可以是由操作系统预先就创建好的线程。</p>
<p>
        这里就涉及到一个取舍的问题了。</p>
<p>
        如果我们的处理函数很简单,那么就没有必要创建一个单独的线程来处理了。</p>
<p>
        原因有二:</p>
<ol>
<li>
                创建一个内核线程是很耗费资源的,如果函数很简单,很快执行结束之后再关闭线程,太划不来了,得不偿失;</li>
        <li>
                如果每一个驱动程序编写者都毫无节制地创建内核线程,那么内核中将会存在大量不必要的线程,当然了本质上还是系统资源消耗和执行效率的问题;</li>
</ol>
<p>
        为了避免这种情况,于是操作系统就为我们预先创建好一些工作队列和内核线程。</p>
<p>
        我们只需要把需要处理的工作项,直接添加到这些预先创建好的工作队列中就可以了,它们就会被相应的内核线程取出来处理。</p>
<p>
        例如下面这些工作队列,就是内核默认创建的(include/linux/workqueue.h):</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>/* </span></span>
</li>
        <li class="alt">
                <span>* System-wide workqueues which are always present. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* system_wq is the one used by schedule_work(). </span>
</li>
        <li class="alt">
                <span>* Multi-CPU multi-threaded. There are users which expect relatively </span>
</li>
        <li class="alt">
                <span>* short queue flush time. Don't queue works which can run for too </span>
</li>
        <li class="alt">
                <span>* long. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* system_highpri_wq is similar to system_wq but for work items which </span>
</li>
        <li class="alt">
                <span>* require WQ_HIGHPRI. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* system_long_wq is similar to system_wq but may host long running </span>
</li>
        <li class="alt">
                <span>* works. Queue flushing might take relatively long. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* system_unbound_wq is unbound workqueue. Workers are not bound to </span>
</li>
        <li class="alt">
                <span>* any specific CPU, not concurrency managed, and all queued works are </span>
</li>
        <li class="alt">
                <span>* executed immediately as long as max_active limit is not reached and </span>
</li>
        <li class="alt">
                <span>* resources are available. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* system_freezable_wq is equivalent to system_wq except that it's </span>
</li>
        <li class="alt">
                <span>* freezable. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* *_power_efficient_wq are inclined towards saving power and converted </span>
</li>
        <li class="alt">
                <span>* into WQ_UNBOUND variants if 'wq_power_efficient' is enabled; otherwise, </span>
</li>
        <li class="alt">
                <span>* they are same as their non-power-efficient counterparts - e.g. </span>
</li>
        <li class="alt">
                <span>* system_power_efficient_wq is identical to system_wq if </span>
</li>
        <li class="alt">
                <span>* 'wq_power_efficient' is disabled. See WQ_POWER_EFFICIENT for more info. </span>
</li>
        <li class="alt">
                <span>*/ </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_wq; </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_highpri_wq; </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_long_wq; </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_unbound_wq; </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_freezable_wq; </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_power_efficient_wq; </span>
</li>
        <li class="alt">
                <span>extern struct workqueue_struct *system_freezable_power_efficient_wq; </span>
</li>
</ol>
<p>
        以上这些默认工作队列的创建代码是(kernel/workqueue.c):</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>int __init workqueue_init_early(void) </span></span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>... </span>
</li>
        <li class="alt">
                <span><span class="attribute">system_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events", 0, 0); </span></span>
</li>
        <li class="alt">
                <span><span class="attribute">system_highpri_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events_highpri", WQ_HIGHPRI, 0); </span></span>
</li>
        <li>
                <span><span class="attribute">system_long_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events_long", 0, 0); </span></span>
</li>
        <li class="alt">
                <span><span class="attribute">system_unbound_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events_unbound", WQ_UNBOUND, </span></span>
</li>
        <li class="alt">
                <span>WQ_UNBOUND_MAX_ACTIVE); </span>
</li>
        <li class="alt">
                <span><span class="attribute">system_freezable_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events_freezable", </span></span>
</li>
        <li class="alt">
                <span>WQ_FREEZABLE, 0); </span>
</li>
        <li class="alt">
                <span><span class="attribute">system_power_efficient_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events_power_efficient", </span></span>
</li>
        <li class="alt">
                <span>WQ_POWER_EFFICIENT, 0); </span>
</li>
        <li class="alt">
                <span><span class="attribute">system_freezable_power_efficient_wq</span><span> = </span><span class="attribute-value">alloc_workqueue</span><span>("events_freezable_power_efficient", </span></span>
</li>
        <li class="alt">
                <span>WQ_FREEZABLE | WQ_POWER_EFFICIENT, </span>
</li>
        <li class="alt">
                <span>0); </span>
</li>
        <li class="alt">
                <span>... </span>
</li>
        <li>
                <span>} </span>
</li>
</ol>
<p>
        此外,由于工作队列 system_wq 被使用的频率很高,于是内核就封装了一个简单的函数(schedule_work)给我们使用:</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>/** </span></span>
</li>
        <li class="alt">
                <span>* schedule_work - put work task in global workqueue </span>
</li>
        <li class="alt">
                <span>* @work: job to be done </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* Returns %false if @work was already on the kernel-global workqueue and </span>
</li>
        <li class="alt">
                <span>* %true otherwise. </span>
</li>
        <li class="alt">
                <span>* </span>
</li>
        <li class="alt">
                <span>* This puts a job in the kernel-global workqueue if it was not already </span>
</li>
        <li class="alt">
                <span>* queued and leaves it in the same position on the kernel-global </span>
</li>
        <li class="alt">
                <span>* workqueue otherwise. </span>
</li>
        <li>
                <span>*/ </span>
</li>
        <li class="alt">
                <span>static inline bool schedule_work(struct work_struct *work){ </span>
</li>
        <li class="alt">
                <span>return queue_work(system_wq, work); </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
</ol>
<p>
        当然了,任何事情有利就有弊!</p>
<p>
        由于内核默认创建的工作队列,是被所有的驱动程序共享的。</p>
<p>
        如果所有的驱动程序都把等待处理的工作项委托给它们来处理,那么就会导致某个工作队列中过于拥挤。</p>
<p>
        根据先来后到的原则,工作队列中后加入的工作项,就可能因为前面工作项的处理函数执行的时间太长,从而导致时效性无法保证。</p>
<p>
        因此,这里存在一个系统平衡的问题。</p>
<p>
        关于工作队列的基本知识点就介绍到这里,下面来实际操作验证一下。</p>
<p class="maodian"></p><h2>
        驱动程序</h2>
<p>
        之前的几篇文章,在驱动程序中测试中断处理的操作流程都是一样的,因此这里就不在操作流程上进行赘述了。</p>
<p>
        这里直接给出驱动程序的全貌代码,然后查看 dmesg 的输出信息。</p>
<p>
        创建驱动程序源文件和 Makefile:</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>$ cd tmp/linux-4.15/drivers </span></span>
</li>
        <li class="alt">
                <span>$ mkdir my_driver_interrupt_wq </span>
</li>
        <li class="alt">
                <span>$ touch my_driver_interrupt_wq.c </span>
</li>
        <li class="alt">
                <span>$ touch Makefile </span>
</li>
</ol>
<p>
        <strong>示例代码全貌</strong></p>
<p>
        测试场景是:加载驱动模块之后,如果监测到键盘上的ESC键被按下,那么就往内核默认的工作队列system_wq中增加一个工作项,然后观察该工作项对应的处理函数是否被调用。</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>#include </span><span class="tag">&lt;</span><span class="tag-name">linux</span><span>/kernel.h</span><span class="tag">&gt;</span><span> </span></span>
</li>
        <li class="alt">
                <span>#include <span class="tag">&lt;</span><span class="tag-name">linux</span><span>/module.h</span><span class="tag">&gt;</span><span> </span></span>
</li>
        <li class="alt">
                <span>#include <span class="tag">&lt;</span><span class="tag-name">linux</span><span>/interrupt.h</span><span class="tag">&gt;</span><span> </span></span>
</li>
        <li class="alt">
                <span>static int irq; </span>
</li>
        <li>
                <span>static char * devname; </span>
</li>
        <li>
                <span>static struct work_struct mywork; </span>
</li>
        <li>
                <span>// 接收驱动模块加载时传入的参数 </span>
</li>
        <li class="alt">
                <span>module_param(irq, int, 0644); </span>
</li>
        <li class="alt">
                <span>module_param(devname, charp, 0644); </span>
</li>
        <li class="alt">
                <span>// 定义驱动程序的 ID,在中断处理函数中用来判断是否需要处理 </span>
</li>
        <li>
                <span>#define MY_DEV_ID 1226 </span>
</li>
        <li class="alt">
                <span>// 驱动程序数据结构 </span>
</li>
        <li class="alt">
                <span>struct myirq </span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>int devid; </span>
</li>
        <li class="alt">
                <span>}; </span>
</li>
        <li class="alt">
                <span>struct myirq <span class="attribute">mydev</span><span> ={ MY_DEV_ID }; </span></span>
</li>
        <li class="alt">
                <span>#define KBD_DATA_REG 0x60 </span>
</li>
        <li class="alt">
                <span>#define KBD_STATUS_REG 0x64 </span>
</li>
        <li class="alt">
                <span>#define KBD_SCANCODE_MASK 0x7f </span>
</li>
        <li class="alt">
                <span>#define KBD_STATUS_MASK 0x80 </span>
</li>
        <li class="alt">
                <span>// 工作项绑定的处理函数 </span>
</li>
        <li class="alt">
                <span>static void mywork_handler(struct work_struct *work) </span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>printk("mywork_handler is called. \n"); </span>
</li>
        <li class="alt">
                <span>// do some other things </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>//中断处理函数 </span>
</li>
        <li class="alt">
                <span>static irqreturn_t myirq_handler(int irq, void * dev) </span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>struct myirq mydev; </span>
</li>
        <li class="alt">
                <span>unsigned char key_code; </span>
</li>
        <li class="alt">
                <span><span class="attribute">mydev</span><span> = *(struct myirq*)dev; </span></span>
</li>
        <li>
                <span>// 检查设备 id,只有当相等的时候才需要处理 </span>
</li>
        <li class="alt">
                <span>if (<span class="attribute">MY_DEV_ID</span><span> == mydev.devid) </span></span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>// 读取键盘扫描码 </span>
</li>
        <li class="alt">
                <span><span class="attribute">key_code</span><span> = </span><span class="attribute-value">inb</span><span>(KBD_DATA_REG); </span></span>
</li>
        <li class="alt">
                <span>if (<span class="attribute">key_code</span><span> == 0x01) </span></span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li>
                <span>printk("ESC key is pressed! \n"); </span>
</li>
        <li>
                <span>// 初始化工作项 </span>
</li>
        <li class="alt">
                <span>INIT_WORK(&amp;mywork, mywork_handler); </span>
</li>
        <li>
                <span>// 加入到工作队列 system_wq </span>
</li>
        <li class="alt">
                <span>schedule_work(&amp;mywork); </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>return IRQ_HANDLED; </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>// 驱动模块初始化函数 </span>
</li>
        <li class="alt">
                <span>static int __init myirq_init(void) </span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>printk("myirq_init is called. \n"); </span>
</li>
        <li class="alt">
                <span>// 注册中断处理函数 </span>
</li>
        <li class="alt">
                <span>if(request_irq(irq, myirq_handler, IRQF_SHARED, devname, &amp;mydev)!=0) </span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>printk("register irq[%d] handler failed. \n", irq); </span>
</li>
        <li class="alt">
                <span>return -1; </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>printk("register irq[%d] handler success. \n", irq); </span>
</li>
        <li class="alt">
                <span>return 0; </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>// 驱动模块退出函数 </span>
</li>
        <li class="alt">
                <span>static void __exit myirq_exit(void) </span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>printk("myirq_exit is called. \n"); </span>
</li>
        <li class="alt">
                <span>// 释放中断处理函数 </span>
</li>
        <li class="alt">
                <span>free_irq(irq, &amp;mydev); </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
        <li class="alt">
                <span>MODULE_LICENSE("GPL"); </span>
</li>
        <li class="alt">
                <span>module_init(myirq_init); </span>
</li>
        <li class="alt">
                <span>module_exit(myirq_exit); </span>
</li>
</ol>
<p>
        <strong>Makefile 文件</strong></p>
<ol class="dp-xml">
<li class="alt">
                <span><span>ifneq ($(KERNELRELEASE),) </span></span>
</li>
        <li class="alt">
                <span>obj-m <span class="attribute">:</span><span>= </span><span class="attribute-value">my_driver_interrupt_wq</span><span>.o </span></span>
</li>
        <li class="alt">
                <span>else </span>
</li>
        <li class="alt">
                <span>KERNELDIR ?= /lib/modules/$(shell uname -r)/build </span>
</li>
        <li class="alt">
                <span>PWD <span class="attribute">:</span><span>= $(shell pwd) </span></span>
</li>
        <li class="alt">
                <span>default: </span>
</li>
        <li class="alt">
                <span>$(MAKE) -C $(KERNELDIR) <span class="attribute">M</span><span>=$(PWD) modules </span></span>
</li>
        <li class="alt">
                <span>clean: </span>
</li>
        <li class="alt">
                <span>$(MAKE) -C $(KERNEL_PATH) <span class="attribute">M</span><span>=$(PWD) clean </span></span>
</li>
        <li class="alt">
                <span>endif </span>
</li>
</ol>
<p>
        <strong>编译、测试</strong></p>
<ol class="dp-xml">
<li class="alt">
                <span><span>$ make </span></span>
</li>
        <li class="alt">
                <span>$ sudo insmod my_driver_interrupt_wq.ko <span class="attribute">irq</span><span>=</span><span class="attribute-value">1</span><span> </span><span class="attribute">devname</span><span>=</span><span class="attribute-value">mydev</span><span> </span></span>
</li>
</ol>
<p>
        检查驱动模块是否加载成功:</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>$ lsmod | grep my_driver_interrupt_wq </span></span>
</li>
        <li class="alt">
                <span>my_driver_interrupt_wq 16384 0 </span>
</li>
</ol>
<p>
        再看一下 dmesg 的输出信息:</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>$ dmesg </span></span>
</li>
        <li class="alt">
                <span>... </span>
</li>
        <li class="alt">
                <span>[ 188.247636] myirq_init is called. </span>
</li>
        <li class="alt">
                <span>[ 188.247642] register irq handler success. </span>
</li>
</ol>
<p>
        说明:驱动程序的初始化函数 myirq_init 被调用了,并且成功注册了 1 号中断的处理程序。</p>
<p>
        此时,按一下键盘上的 ESC 键。</p>
<p>
        操作系统在捕获到键盘中断之后,会依次调用此中断的所有中断处理程序,其中就包括我们注册的 myirq_handler 函数。</p>
<p>
        在这个函数中,当判断出是ESC按键时,就初始化一个工作项(把结构体 work_struct 类型的变量与一个处理函数绑定起来),然后丢给操作系统预先创建好的工作队列(system_wq)去处理,如下所示:</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>if (</span><span class="attribute">key_code</span><span> == 0x01) </span></span>
</li>
        <li class="alt">
                <span>{ </span>
</li>
        <li class="alt">
                <span>printk("ESC key is pressed! \n"); </span>
</li>
        <li class="alt">
                <span>INIT_WORK(&amp;mywork, mywork_handler); </span>
</li>
        <li class="alt">
                <span>schedule_work(&amp;mywork); </span>
</li>
        <li class="alt">
                <span>} </span>
</li>
</ol>
<p>
        因此,当相应的内核线程从这个工作队列(system_wq)中取出工作项(mywork)来处理的时候,函数 mywork_handler 就会被调用。</p>
<p>
        现在来看一下 dmesg 的输出信息:</p>
<ol class="dp-xml">
<li class="alt">
                <span><span>[ 305.053155] ESC key is pressed! </span></span>
</li>
        <li class="alt">
                <span>[ 305.053177] mywork_handler is called. </span>
</li>
</ol>
<p>
        可以看到:mywork_handler函数被正确调用了。</p>
<p>
        完美!</p>
<p>
        原文链接:https://mp.weixin.qq.com/s/WDPXOBBAni09UvgV_zxy0g</p>
頁: [1]
查看完整版本: Linux 工作队列 workqueue 是什么鬼?