LinMao's Blog
学习科研记录与分享!

Common data races in Go

I am working with Milind to fix the data race in Uber's Go database. Here is the common data races concluded from his paper: "A Study of Real-World Data Races in Golang".

Pattern 1: Loop variable capture by reference.

In Go, the loop iterator variable in for range is a single variable that takes different values in each loop iteration. (Loop variable capture by reference.)

It's better to make a local copy when using goroutines to process the loop variables.

Pattern 2: Error variable capture

Mixed use of error variable. In the following example, the concurrent writes on the return error variable from foo2() and foo3() cause a data race.

Pattern 3: Normal return in a function with a named return

In Go, there is a feature called "named result parameters". The return variable can be given names and used as regular variables, just like the incoming parameters. When named, they are initialized to the zero values for their types when the function begins; if the function executes a return statement without arguments, the current values of the result parameters are used as the returned values.

 

Pattern 4: Deferred functions in a named return

Go offers a keyword defer to schedule a function call (the deferred function) to be run immediately before the function executing the defer returns.

Pattern 5: Meta fields of Slice change

The append operation appends the elements to the slice's end and returns the result. The slice may change as the number of elements reaches the slice's length, which returns a new slice.

Pattern 6: Data Races on Thread-Unsafe Map

In Go, a map (hash table) is a sparse data structure, and accessing one element might result in accessing another element; if during the same process another insertion/deletion happens, it will modify the sparse data structure and cause a data race.

Pattern 7: Incorrect use of synchronization constructs

Incorrect use of synchronization constructs such as sync.Mutex and sync.RWMutex, which are value types (structures) in Go, often causes data races.

In the following example, the mutex passed to the two goroutines is captured by value. The two goroutines possess different mutex objects sharing no internal state. A correct implementation should pass the address of mutex (&mutex).

Pattern 8: Mixing Shared Memory with Message Passing

Pattern 9: Incorrect Use of Flexible Group Synchronization

Go offers more leeway in its group synchronization construct sync.WaitGroup. The number of participants is dynamic. Incorrect placement of Add and Done methods of a sync.WaitGroup leads to data races.

In the following example, the incorrect placement of wg.Add(1) fails to guarantee the racyAccess is accessed before reaching wg.Wait().

Also, there is an example of incorrect placement of wg.Done().

Pattern 10: Mutating Shared Data in a Reader-Lock-Protected Critical Section

Go offers a reader-writer lock RWMutex, which allows concurrent readers to execute a critical section simultaneously. The RLock/RUnlock methods on an RWMutex hold the lock in a read-only mode. Sometimes developers accidentally put statements that may modify shared data in critical sections protected by RWMutex, while using RLock/RUnlock methods.

 

Reference:

Chabbi, Milind, and Murali Krishna Ramanathan. "A study of real-world data races in Golang." Proceedings of the 43rd ACM SIGPLAN International Conference on Programming Language Design and Implementation. 2022.

赞(3) 打赏
转载请注明出处:LinMao's Blog(林茂的博客) » Common data races in Go

评论 抢沙发

  • 昵称 (必填)
  • 邮箱 (必填)
  • 网址

LinMao's Blog(林茂的博客)

了解更多联系我们

觉得文章有用就打赏一下作者吧~

支付宝扫一扫打赏

微信扫一扫打赏