03 - DFA (Application)
Basic Concepts
Data flows on CFG
data:image/s3,"s3://crabby-images/b7d37/b7d37ac21135cf2a3f7c26221b79f0298a06d54e" alt=""
In CFG, the application-specific Data(Abstraction) will flows through the Nodes(BBs/Statement), Edges(control flow), CFG(a program).
Most static analyses are may analysis:
May Analysis:
Output maybe true. (over-approximation) – 近似值也可以
Must Analysis:
Output must true. (Under-approximation) – 不想要近似值
Preliminaries of DFA
DFA(here): Data Flow Analysis
data:image/s3,"s3://crabby-images/aac57/aac57aa558735b6c1a96bcd10b0957dd23654f27" alt=""
Each red point represents as a program states.
Each green filed for data-flow values.
domain: 不同的 DFA 有不同的 domain 值,domain 包含了所有该应用可能出现的值
domain values 有可能是 variables, expressions …
Forwards and Backwards
data:image/s3,"s3://crabby-images/3dd05/3dd0596d6bd5ce9bf018c59540d301da11a631e0" alt=""
- Forward: yellow part.
- Backward: red part.
- Meet: BBs 间的运算方式
Understand the three data flow analyses
Reaching Definitions
data:image/s3,"s3://crabby-images/718f3/718f3aeb6056633275c1b5a9b617d9707ef31f3a" alt=""
- Generates a definition and kills all the other definitions including variable v
- Forwards direction
- Domain: definitions
- Meet: $\bigcup$
data:image/s3,"s3://crabby-images/3e902/3e90288b3b8c0abdbe7f5cf274339a9587b7de17" alt=""
- Initialization: For every BBs(including entry) initialized as NULL set.
- Stop: any OUT sets wouldn’t change.
An example:
data:image/s3,"s3://crabby-images/77885/778858fcc2d76e77b7395bf06c88f31605ae212c" alt=""
data:image/s3,"s3://crabby-images/a7947/a794716c814d038e842d7cb3b85d67812f395d86" alt=""
需要注意:
- Meet 操作是 union 还是 intersection
- IN[s] 的 s 是否有几个 BBs,看箭头数量即可
- IN 不变,OUT 也不变,存在 monotonicity(单调性),可以到达 fixed point.
Live Variables
data:image/s3,"s3://crabby-images/03675/03675d3df714d9425cf59e7aeceed94726b6ce59" alt=""
- Domain:Variables
- OUT:后续已知 IN 的并集
- $use_B$ :表示在 B 中使用了未被 redefine 的 vars
- $def_B$:表示 vars 在 B 中被重定义了,$IN[B]$ 的对应 var 值为 dead value,需要被 kill 掉
data:image/s3,"s3://crabby-images/be152/be152892ccdc596fc53b976ea76a2a63505ffa6d" alt=""
注意:
Initialization: for each BBs including exit initialed as IN[Block] = $\varnothing$
Meet: $\bigcup$
Direction: Backwards
An example:
data:image/s3,"s3://crabby-images/2e013/2e013f3e03f72d0fce6a25b7e0cb02472729a6d1" alt=""
data:image/s3,"s3://crabby-images/4cafb/4cafb933947fdd8cf1c1009eadb2ff4bb6a2e415" alt=""
Tips:
- 最好把 IN 和 OUT 都表示出来,这样不容易混
- Domain 是 variables,容易与 Reaching Definitions 的 definitions 混淆
- 应用:寄存器值替换,如果一个值是 dead value,可以清空值所在的寄存器,节省空间
Available Expressions
data:image/s3,"s3://crabby-images/4e210/4e21049a82104b2fa5a27f0b764918e41dbc4628" alt=""
Meet: $\bigcap$
碎碎念: 如果是 $\bigcup$ 则结果为 may,如果是 $\bigcap$ 则结果为 must
Domain: expression
Define 是每个 block 内的定义式,expression 有可能出现在不同的 block 内
data:image/s3,"s3://crabby-images/c67a3/c67a3bd14c1d501940153394286a322dcf6d09cb" alt=""
OUT[entry] = $\varnothing$
OUT[other blocks] = $\bigcup$ –>
11111...111
如果不全部初始化为 1,之后取交集会陷入全 0 的死循环
An example:
data:image/s3,"s3://crabby-images/9b9e1/9b9e16ae4768d5dd9c5f78ae2400c181c4e256ea" alt=""
data:image/s3,"s3://crabby-images/29bf6/29bf698441ef977713539989732e9c0312651a40" alt=""
碎碎念: 注意 Available expression 的定义,转移过程中仅关注表达式的值是否还能被再次利用,如果不能(不能说明值已更新),则进行 expression 的替换
Differences and Similarities of the three data flow analyses
Fill the blank:
data:image/s3,"s3://crabby-images/ac52a/ac52a94d6c04c4b64d3e8597e36b939393b43c75" alt=""
Answer:
data:image/s3,"s3://crabby-images/76c85/76c85c9e0de74a8c00e282e137c5f822c40575ca" alt=""
- Boundary: 对 Exit 和 Entry 的限制
- Initialization: DFA 中对 BBs 的初始化,跟 Direction 和 Meet 相关
Interative Alg. and why it is able to terminate
My Answer:
Because when INs are not change, the OUTs will not change, after iterations, the INs and OUTs will reach a fixed point.
Right Answer:
data:image/s3,"s3://crabby-images/4eb3a/4eb3aa2234fd12bf344de7b54cc6670a725a4d5d" alt=""