频繁创建和销毁北极星SDK实例的时候导致CPU的使用持续飙升及内存泄漏风险 #1370

pemako · 2024-07-16T08:34:48Z

问题描述：
场景是在大量的定时任务脚本中会初始化北极星实例，定时任务一般是在分钟级别完成。一天大概会有20W的量级，服务发布3天发现 CPU再持续增长。

分析：
通过 pprof 查看发现大部分时间耗费 healthcheck 中，对应的代码为值 https://github.com/polarismesh/polaris/blob/v1.17.5/service/healthcheck/check.go#L147

避免对 instance 的遍历会降低 CPU 的使用

func (c *CheckScheduler) processAdoptEvents(
	instances map[string]bool, add bool, checker plugin.HealthChecker) map[string]bool {
	instanceIds := make([]string, 0, len(instances))
        //  注释掉下面三行取消遍历会降低 CPU 的使用
	//for id := range instances {
	//	instanceIds = append(instanceIds, id)
	//}
	log.Debug("[Health Check][Check] adopt event", zap.Any("instances", instanceIds),
		zap.String("server", server.localHost), zap.Bool("add", add))
	return instances
}

在 doAdopt 函数中 instancesToRemove 这个 map 的 size 会一直增大，还是存在内存泄漏的问题

func (c *CheckScheduler) doAdopt(ctx context.Context) {
	instancesToAdd := make(map[string]bool)
	instancesToRemove := make(map[string]bool)
	var checker plugin.HealthChecker
	ticker := time.NewTicker(batchAdoptInterval)
        ....
}

使用到的版本是 1.17.5

The text was updated successfully, but these errors were encountered:

shichaoyuan · 2024-07-26T07:22:28Z

实例没有deregister？

chuntaojun · 2024-07-30T05:43:50Z

改代码最新的 main 分支中已经移除该逻辑

pemako added the bug Something isn't working label Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

频繁创建和销毁北极星SDK实例的时候导致CPU的使用持续飙升及内存泄漏风险 #1370

频繁创建和销毁北极星SDK实例的时候导致CPU的使用持续飙升及内存泄漏风险 #1370

pemako commented Jul 16, 2024

shichaoyuan commented Jul 26, 2024

chuntaojun commented Jul 30, 2024

频繁创建和销毁北极星SDK实例的时候导致CPU的使用持续飙升及内存泄漏风险 #1370

频繁创建和销毁北极星SDK实例的时候导致CPU的使用持续飙升及内存泄漏风险 #1370

Comments

pemako commented Jul 16, 2024

shichaoyuan commented Jul 26, 2024

chuntaojun commented Jul 30, 2024