07 深入玩转pod调度

unlisted · suofeiya's blog

#kubernetes

Table of Contents

写在前面 #

上一篇文章中kubernetes系列教程(六)kubernetes资源管理和服务质量初步介绍了kubernetes中的resource资源调度和服务质量Qos,介绍了kubernetes中如何定义pod的资源和资源调度,以及设置resource之后的优先级别Qos,接下来介绍kubernetes系列教程pod的调度机制。

1. Pod调度 #

1.1 pod调度概述 #

kubernets是容器编排引擎,其中最主要的一个功能是容器的调度,通过kube-scheduler实现容器的完全自动化调度,调度周期分为:调度周期Scheduling Cycle和绑定周期Binding Cycle,其中调度周期细分为过滤filter和weight称重,按照指定的调度策略将满足运行pod节点的node赛选出来,然后进行排序;绑定周期是经过kube-scheduler调度优选的pod后,由特定的node节点watch然后通过kubelet运行。

Pod调度机制

过滤阶段包含预选Predicate和scoring排序,预选是筛选满足条件的node,排序是最满足条件的node打分并排序,预选的算法包含有:

过滤条件需要检查node上满足的条件,可以通过kubectl describe node node-id方式查看,如下图:

node调度条件condition

优选调度算法有:

1. 2 指定nodeName调度 #

nodeName是PodSpec中的一个字段,可以通过pod.spec.nodeName指定将pod调度到某个具体的node节点上,该字段比较特殊一般都为空,如果有设置nodeName字段,kube-scheduler会直接跳过调度,在特定节点上通过kubelet启动pod。通过nodeName调度并非是集群的智能调度,通过指定调度的方式可能会存在资源不均匀的情况,建议设置Guaranteed的Qos,防止资源不均时候Pod被驱逐evince。如下以创建一个pod运行在node-3上为例:

  1. 编写yaml将pod指定在node-3节点上运行
 1[root@node-1 demo]# cat nginx-nodeName.yaml 
 2apiVersion: v1
 3kind: Pod
 4metadata:
 5  name: nginx-run-on-nodename
 6  annotations:
 7    kubernetes.io/description: "Running the Pod on specific nodeName"
 8spec:
 9  containers:
10  - name: nginx-run-on-nodename
11    image: nginx:latest
12    ports:
13    - name: http-80-port
14      protocol: TCP
15      containerPort: 80 
16  nodeName: node-3       #通过nodeName指定将nginx-run-on-nodename运行在特定节点node-3
  1. 运行yaml配置使之生效
1[root@node-1 demo]# kubectl apply -f nginx-nodeName.yaml 
2pod/nginx-run-on-nodename created
  1. 查看确认pod的运行情况,已运行在node-3节点
1[root@node-1 demo]# kubectl get pods nginx-run-on-nodename -o wide 
2NAME                    READY   STATUS    RESTARTS   AGE     IP            NODE     NOMINATED NODE   READINESS GATES
3nginx-run-on-nodename   1/1     Running   0          6m52s   10.244.2.15   node-3   <none>           <none>

1.2. 通过nodeSelector调度 #

1nodeSelector是PodSpec中的一个字段nodeSelector是最简单实现将pod运行在特定node节点的实现方式其通过指定key和value键值对的方式实现需要node设置上匹配的Labels节点调度的时候指定上特定的labels即可如下以node-2添加一个app:web的labels调度pod的时候通过nodeSelector选择该labels
  1. 给node-2添加labels
1[root@node-1 demo]# kubectl label node node-2 app=web
2node/node-2 labeled
  1. 查看校验labels设置情况,node-2增加多了一个app=web的labels
1[root@node-1 demo]# kubectl get nodes --show-labels 
2NAME     STATUS   ROLES    AGE   VERSION   LABELS
3node-1   Ready    master   15d   v1.15.3   beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node-1,kubernetes.io/os=linux,node-role.kubernetes.io/master=
4node-2   Ready    <none>   15d   v1.15.3   app=web,beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node-2,kubernetes.io/os=linux
5node-3   Ready    <none>   15d   v1.15.3   beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node-3,kubernetes.io/os=linux
  1. 通过nodeSelector将pod调度到app=web所属的labels
 1[root@node-1 demo]# cat nginx-nodeselector.yaml 
 2apiVersion: v1
 3kind: Pod
 4metadata:
 5  name: nginx-run-on-nodeselector
 6  annotations:
 7    kubernetes.io/description: "Running the Pod on specific node by nodeSelector"
 8spec:
 9  containers:
10  - name: nginx-run-on-nodeselector
11    image: nginx:latest
12    ports:
13    - name: http-80-port
14      protocol: TCP
15      containerPort: 80 
16  nodeSelector:     #通过nodeSelector将pod调度到特定的labels
17    app: web
  1. 应用yaml文件生成pod
1[root@node-1 demo]# kubectl apply -f nginx-nodeselector.yaml 
2pod/nginx-run-on-nodeselector created
  1. 检查验证pod的运行情况,已经运行在node-2节点
1[root@node-1 demo]# kubectl get pods nginx-run-on-nodeselector -o wide 
2NAME                        READY   STATUS    RESTARTS   AGE   IP            NODE     NOMINATED NODE   READINESS GATES
3nginx-run-on-nodeselector   1/1     Running   0          51s   10.244.1.24   node-2   <none>           <none>

系统默认预先定义有多种内置的labels,这些labels可以标识node的属性,如arch架构,操作系统类型,主机名等

1.3 node Affinity and anti-affinity #

affinity/anti-affinity和nodeSelector功能相类似,相比于nodeSelector,affinity的功能更加丰富,未来会取代nodeSelector,affinity增加了如下的一些功能增强:

下面通过一个例子来演示node affinity的使用,requiredDuringSchedulingIgnoredDuringExecution指定需要满足的条件,preferredDuringSchedulingIgnoredDuringExecution指定优选的条件,两者之间取与关系。

  1. 查询node节点的labels,默认包含有多个labels,如kubernetes.io/hostname
1[root@node-1 ~]# kubectl get nodes --show-labels 
2NAME  STATUS  ROLES AGE  VERSION  LABELS
3node-1  Ready master  15d  v1.15.3  beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node-1,kubernetes.io/os=linux,node-role.kubernetes.io/master=
4node-2  Ready <none>  15d  v1.15.3  app=web,beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node-2,kubernetes.io/os=linux
5node-3  Ready <none>  15d  v1.15.3  beta.kubernetes.io/arch=amd64,beta.kubernetes.io/os=linux,kubernetes.io/arch=amd64,kubernetes.io/hostname=node-3,kubernetes.io/os=linux
  1. 通过node affiinity实现调度,通过requiredDuringSchedulingIgnoredDuringExecution指定满足条件kubernetes.io/hostname为node-2和node-3,通过preferredDuringSchedulingIgnoredDuringExecution优选条件需满足app=web的labels
 1[root@node-1 demo]# cat nginx-node-affinity.yaml 
 2apiVersion: v1
 3kind: Pod
 4metadata:
 5  name: nginx-run-node-affinity
 6  annotations:
 7    kubernetes.io/description: "Running the Pod on specific node by node affinity"
 8spec:
 9  containers:
10  - name: nginx-run-node-affinity
11    image: nginx:latest
12    ports:
13    - name: http-80-port
14      protocol: TCP
15      containerPort: 80 
16  affinity:
17    nodeAffinity:
18      requiredDuringSchedulingIgnoredDuringExecution:
19        nodeSelectorTerms:
20        - matchExpressions:
21          - key: kubernetes.io/hostname
22            operator: In
23            values:
24            - node-1
25            - node-2
26            - node-3
27      preferredDuringSchedulingIgnoredDuringExecution:
28      - weight: 1
29        preference:
30          matchExpressions:
31          - key: app
32            operator: In
33            values: ["web"] 
  1. 应用yaml文件生成pod
1[root@node-1 demo]# kubectl apply -f nginx-node-affinity.yaml 
2pod/nginx-run-node-affinity created
  1. 确认pod所属的node节点,满足require和 preferre条件的节点是node-2
1[root@node-1 demo]# kubectl get pods --show-labels nginx-run-node-affinity -o wide 
2NAME                      READY   STATUS    RESTARTS   AGE    IP            NODE     NOMINATED NODE   READINESS GATES   LABELS
3nginx-run-node-affinity   1/1     Running   0          106s   10.244.1.25   node-2   <none>           <none>            <none>

写在最后 #

本文介绍了kubernetes中的调度机制,默认创建pod是全自动调度机制,调度由kube-scheduler实现,调度过程分为两个阶段调度阶段(过滤和沉重排序)和绑定阶段(在node上运行pod)。通过干预有四种方式:

  1. 指定nodeName
  2. 通过nodeSelector
  3. 通过node affinity和anti-affinity
  4. 通过pod affinity和anti-affinity

附录 #

调度框架介绍:https://kubernetes.io/docs/concepts/configuration/scheduling-framework/

Pod调度方法:https://kubernetes.io/docs/concepts/configuration/assign-pod-node/

『 转载 』该文章来源于网络,侵删。