Merge from tree-cleanup-branch: VRP, store CCP, store copy-prop, incremental SSA updating of FUD chains and newly exposed symbols. * Makefile.in (tree-ssa-copy.o): Depend on tree-ssa-propagate.h. (OBJS-common): Add tree-vrp.o. (tree-vrp.o): New rule. * basic-block.h (nearest_common_dominator_for_set): Declare. * common.opt (ftree-store-ccp): New flag. (ftree-copy-prop): New flag. (ftree-vrp): New flag. (ftree-store-copy-prop): New flag. * dominance.c (nearest_common_dominator_for_set): New. * domwalk.c (walk_dominator_tree): Only traverse statements in blocks marked in walk_data->interesting_blocks. * domwalk.h (struct dom_walk_data): Add field interesting_blocks. * fold-const.c (fold): Handle ASSERT_EXPR. * opts.c (decode_options): Set flag_tree_copy_prop at -O1. Set flag_tree_store_ccp, flag_tree_store_copy_prop and flag_tree_vrp at -O2. * timevar.def (TV_TREE_VRP): Define. (TV_TREE_COPY_PROP): Define. (TV_TREE_STORE_COPY_PROP): Define. (TV_TREE_SSA_INCREMENTAL): Define. (TV_TREE_STORE_CCP): Define. * tree-cfg.c (tree_can_merge_blocks_p): Remove reference to kill_redundant_phi_nodes from comment. (verify_expr): Handle ASSERT_EXPR. * tree-dfa.c (mark_new_vars_to_rename): Remove second argument. Update all users. (mark_call_clobbered_vars_to_rename): Remove. Update all users. * tree-flow-inline.h (unmodifiable_var_p): New. * tree-flow.h (enum value_range_type): Declare. (struct value_range_def): Declare. (value_range): Declare. (remove_all_phi_nodes_for): Remove. Update all users. (find_phi_node_for): Declare. (add_type_alias): Declare. (count_uses_and_derefs): Declare. (kill_redundant_phi_nodes): Remove. (rewrite_into_ssa): Remove. (rewrite_def_def_chains): Remove. (update_ssa, register_new_name_mapping, create_new_def_for, need_ssa_update_p, name_registered_for_update_p, release_ssa_name_after_update_ssa, dump_repl_tbl, debug_repl_tbl, dump_names_replaced_by, debug_names_replaced_by, mark_sym_for_renaming, mark_set_for_renaming, get_current_def, set_current_def, get_value_range, dump_value_range, debug_value_range, dump_all_value_ranges, debug_all_value_ranges, expr_computes_nonzero, loop_depth_of_name, unmodifiable_var_p): Declare. * tree-gimple.c (is_gimple_formal_tmp_rhs): Handle ASSERT_EXPR. * tree-into-ssa.c (block_defs_stack): Update comment. (old_ssa_names, new_ssa_names, old_virtual_ssa_names, syms_to_rename, names_to_release, repl_tbl, need_to_initialize_update_ssa_p, need_to_update_vops_p, need_to_replace_names_p): New locals. (NAME_SETS_GROWTH_FACTOR): Define. (struct repl_map_d): Declare. (struct mark_def_sites_global_data): Add field interesting_blocks. (enum rewrite_mode): Declare. (REGISTER_DEFS_IN_THIS_STMT): Define. (compute_global_livein): Use last_basic_block instead of n_basic_blocks. (set_def_block): Remove last argument. Update all callers. (prepare_use_operand_for_rename): Remove. Update all callers. (prepare_def_operand_for_rename): Remove. Update all callers. (symbol_marked_for_renaming): New. (is_old_name): New. (is_new_name): New. (repl_map_hash): New. (repl_map_eq): New. (repl_map_free): New. (names_replaced_by): New. (add_to_repl_tbl): New. (add_new_name_mapping): New. (mark_def_sites): Assume that all the operands in the statement are in normal form. (find_idf): Assert that the block in the stack is valid. (get_default_def_for): New. (insert_phi_nodes_for): Add new argument 'update_p'. Add documentation. If update_p is true, add a new mapping between the LHS of each new PHI and the name that it replaces. (insert_phi_nodes_1): Only call find_idf if needed. (get_reaching_def): Call get_default_def_for. (rewrite_operand): Remove. (rewrite_stmt): Do nothing if REGISTER_DEFS_IN_THIS_STMT and REWRITE_THIS_STMT are false. Assume that all the operands in the statement are in normal form. (rewrite_add_phi_arguments): Don't use PHI_REWRITTEN. (rewrite_virtual_phi_arguments): Remove. (invalidate_name_tags): Remove. (register_new_update_single, register_new_update_set, rewrite_update_init_block, replace_use, rewrite_update_fini_block, rewrite_update_stmt, rewrite_update_phi_arguments): New. rewrite_blocks): Remove argument 'fix_virtual_phis'. Add arguments 'entry', 'what' and 'blocks'. Initialize the dominator walker according to 'what' and 'blocks'. Start the dominator walk at 'entry'. (mark_def_site_blocks): Add argument 'interesting_blocks'. Use it to configure the dominator walker. (rewrite_into_ssa): Remove argument 'all'. Make internal. (rewrite_all_into_ssa): Remove. (rewrite_def_def_chains): Remove. (mark_def_interesting, mark_use_interesting, prepare_phi_args_for_update, prepare_block_for_update, prepare_def_site_for, prepare_def_sites, dump_names_replaced_by, debug_names_replaced_by, dump_repl_tbl, debug_repl_tbl, init_update_ssa, delete_update_ssa, create_new_def_for, register_new_name_mapping, mark_sym_for_renaming, mark_set_for_renaming, need_ssa_update_p, name_registered_for_update_p, ssa_names_to_replace, release_ssa_name_after_update_ssa, insert_updated_phi_nodes_for, update_ssa): New. * tree-loop-linear.c (linear_transform_loops): Call update_ssa instead of rewrite_into_ssa. * tree-optimize.c (vars_to_rename): Remove. Update all users. (init_tree_optimization_passes): Replace pass_redundant_phi with pass_copy_prop. Add pass_vrp. Replace pass_ccp with pass_store_ccp. Add pass_store_copy_prop after pass_store_ccp. (execute_todo): If the TODO_ flags don't include updating the SSA form, assert that it does not need to be updated. Call update_ssa instead of rewrite_into_ssa and rewrite_def_def_chains. If TODO_verify_loops is set, call verify_loop_closed_ssa. (tree_rest_of_compilation): * tree-pass.h (TODO_dump_func, TODO_ggc_collect, TODO_verify_ssa, TODO_verify_flow, TODO_verify_stmts, TODO_cleanup_cfg): Renumber. (TODO_verify_loops, TODO_update_ssa, TODO_update_ssa_no_phi, TODO_update_ssa_full_phi, TODO_update_ssa_only_virtuals): Define. (pass_copy_prop, pass_store_ccp, pass_store_copy_prop, pass_vrp): Declare. * tree-phinodes.c (make_phi_node): Update documentation. (remove_all_phi_nodes_for): Remove. (find_phi_node_for): New. * tree-pretty-print.c (dump_generic_node): Handle ASSERT_EXPR. * tree-scalar-evolution.c (follow_ssa_edge_in_rhs): Likewise. (interpret_rhs_modify_expr): Likewise. * tree-sra.c (decide_instantiations): Mark all symbols in SRA_CANDIDATES for renaming. (mark_all_v_defs_1): Rename from mark_all_v_defs. (mark_all_v_defs): New function. Update all users to call it with the whole list of scalarized statements, not just the first one. * tree-ssa-alias.c (count_ptr_derefs): Make extern. (compute_flow_insensitive_aliasing): If the tag is unmodifiable and the variable isn't or vice-versa, don't make them alias of each other. (setup_pointers_and_addressables): If the type tag for VAR is about to change, mark the old one for renaming. (add_type_alias): New. * tree-ssa-ccp.c: Document SSA-CCP and STORE-CCP. (ccp_lattice_t): Rename from latticevalue. (value): Remove. Update all users. (const_val): New local variable. (do_store_ccp): New local variable. (dump_lattice_value): Handle UNINITIALIZED. (debug_lattice_value): New. (get_default_value): Re-write. (set_lattice_value): Re-write. (def_to_varying): Remove. Update all users. (likely_value): Return VARYING for statements that make stores when STORE_CCP is false. Return VARYING for any statement other than MODIFY_EXPR, COND_EXPR and SWITCH_EXPR. (ccp_initialize): Re-write. (replace_uses_in, replace_vuse_in, substitute_and_fold): Move to tree-ssa-propagate.c. (ccp_lattice_meet): Handle memory stores when DO_STORE_CCP is true. (ccp_visit_phi_node): Likewise. (ccp_fold): Likewise. (evaluate_stmt): Likewise. (visit_assignment): Likewise. (ccp_visit_stmt): Likewise. (execute_ssa_ccp): Add argument 'store_ccp'. Copy it into DO_STORE_CCP. (do_ssa_ccp): New. (pass_ccp): Use it. (do_ssa_store_ccp): New. (gate_store_ccp): New. (pass_store_ccp): Declare. * tree-ssa-copy.c: Include tree-ssa-propagate.h. (may_propagate_copy): Reformat. Don't abort if ORIG is a virtual and DEST isn't. If NEW does not have alias information but DEST does, copy it. (copy_of, cached_last_copy_of, do_store_copy_prop, enum copy_prop_kind, which_copy_prop): Declare. (stmt_may_generate_copy, get_copy_of_val, get_last_copy_of, set_copy_of_val, dump_copy_of, copy_prop_visit_assignment, copy_prop_visit_cond_stmt, copy_prop_visit_stmt, copy_prop_visit_phi_node, init_copy_prop, fini_copy_prop, execute_copy_prop, gate_copy_prop, do_copy_prop, gate_store_copy_prop, store_copy_prop): New. (pass_copy_prop, pass_store_copy_prop): Declare. * tree-ssa-dom.c (struct opt_stats_d): Add fields 'num_const_prop' and 'num_copy_prop'. (cprop_operand): Update them. (dump_dominator_optimization_stats): Dump them. (tree_ssa_dominator_optimize): Call update_ssa instead of rewrite_into_ssa. (loop_depth_of_name): Declare extern. (simplify_cond_and_lookup_avail_expr): Guard against NULL values for LOW or HIGH. (cprop_into_successor_phis): Only propagate if NEW != ORIG. (record_equivalences_from_stmt): Call expr_computes_nonzero. (cprop_operand): Only propagate if VAL != OP. * tree-ssa-dse.c (dse_optimize_stmt): Mark symbols in removed statement for renaming. * tree-ssa-loop-im.c (move_computations): Call update_ssa. * tree-ssa-loop-ivopts.c (rewrite_address_base): Call add_type_alias if necessary. Call mark_new_vars_to_rename. (tree_ssa_iv_optimize): If new symbols need to be renamed, mark every statement updated, call update_ssa and rewrite_into_loop_closed_ssa. * tree-ssa-loop-manip.c (add_exit_phis): Do not remove DEF_BB from LIVEIN if VAR is a virtual. * tree-ssa-loop.c (tree_loop_optimizer_init): Call update_ssa. * tree-ssa-operands.c (get_expr_operands): Handle ASSERT_EXPR. (get_call_expr_operands): Reformat statement. (add_stmt_operand): Don't create V_MAY_DEFs for read-only symbols. * tree-ssa-propagate.c (ssa_prop_init): Initialize SSA_NAME_VALUE for every name. (first_vdef, stmt_makes_single_load, stmt_makes_single_store, get_value_loaded_by): New. (replace_uses_in, replace_vuses_in, replace_phi_args_in, substitute_and_fold): Move from tree-ssa-ccp.c. * tree-ssa-propagate.h (struct prop_value_d, prop_value_t, first_vdef, stmt_makes_single_load, stmt_makes_single_store, get_value_loaded_by, replace_uses_in, substitute_and_fold): Declare. * tree-ssa.c (verify_use): Fix error message. (propagate_into_addr, replace_immediate_uses, get_eq_name, check_phi_redundancy, kill_redundant_phi_nodes, pass_redundant_phi): Remove. Update all users. * tree-vect-transform.c (vect_create_data_ref_ptr): Call add_type_alias, if necessary. * tree-vectorizer.h (struct _stmt_vect_info): Update documentation for field 'memtag'. * tree-vrp.c: New file. * tree.def (ASSERT_EXPR): Define. * tree.h (ASSERT_EXPR_VAR): Define. (ASSERT_EXPR_COND): Define. (SSA_NAME_VALUE_RANGE): Define. (struct tree_ssa_name): Add field 'value_range'. (PHI_REWRITTEN): Remove. (struct tree_phi_node): Remove field 'rewritten'. * doc/invoke.texi (-fdump-tree-storeccp, -ftree-copy-prop, -ftree-store-copy-prop): Document. * doc/tree-ssa.texi: Remove broken link to McCAT's compiler. Document usage of update_ssa. testsuite/ChangeLog * g++.dg/tree-ssa/pr18178.C: New test. * gcc.c-torture/execute/20030216-1.x: Ignore at -O1. * gcc.c-torture/execute/20041019-1.c: New test. * gcc.dg/tree-ssa/20041008-1.c: New test. * gcc.dg/tree-ssa/ssa-ccp-12.c: New test. * gcc.dg/tree-ssa/20030731-2.c: Update to use -fdump-tree-store_ccp. * gcc.dg/tree-ssa/20030917-1.c: Likewise. * gcc.dg/tree-ssa/20030917-3.c: Likewise. * gcc.dg/tree-ssa/20040721-1.c: Likewise. * gcc.dg/tree-ssa/ssa-ccp-1.c: Likewise. * gcc.dg/tree-ssa/ssa-ccp-2.c: Likewise. * gcc.dg/tree-ssa/ssa-ccp-3.c: Likewise. * gcc.dg/tree-ssa/ssa-ccp-7.c: Likewise. * gcc.dg/tree-ssa/ssa-ccp-9.c: Likewise. From-SVN: r97884
976 lines
27 KiB
C
976 lines
27 KiB
C
/* Optimization of PHI nodes by converting them into straightline code.
|
|
Copyright (C) 2004, 2005 Free Software Foundation, Inc.
|
|
|
|
This file is part of GCC.
|
|
|
|
GCC is free software; you can redistribute it and/or modify it
|
|
under the terms of the GNU General Public License as published by the
|
|
Free Software Foundation; either version 2, or (at your option) any
|
|
later version.
|
|
|
|
GCC is distributed in the hope that it will be useful, but WITHOUT
|
|
ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
|
|
FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
|
|
for more details.
|
|
|
|
You should have received a copy of the GNU General Public License
|
|
along with GCC; see the file COPYING. If not, write to the Free
|
|
Software Foundation, 59 Temple Place - Suite 330, Boston, MA
|
|
02111-1307, USA. */
|
|
|
|
#include "config.h"
|
|
#include "system.h"
|
|
#include "coretypes.h"
|
|
#include "tm.h"
|
|
#include "errors.h"
|
|
#include "ggc.h"
|
|
#include "tree.h"
|
|
#include "rtl.h"
|
|
#include "flags.h"
|
|
#include "tm_p.h"
|
|
#include "basic-block.h"
|
|
#include "timevar.h"
|
|
#include "diagnostic.h"
|
|
#include "tree-flow.h"
|
|
#include "tree-pass.h"
|
|
#include "tree-dump.h"
|
|
#include "langhooks.h"
|
|
|
|
static void tree_ssa_phiopt (void);
|
|
static bool conditional_replacement (basic_block, basic_block, basic_block,
|
|
edge, edge, tree, tree, tree);
|
|
static bool value_replacement (basic_block, basic_block, basic_block,
|
|
edge, edge, tree, tree, tree);
|
|
static bool minmax_replacement (basic_block, basic_block, basic_block,
|
|
edge, edge, tree, tree, tree);
|
|
static bool abs_replacement (basic_block, basic_block, basic_block,
|
|
edge, edge, tree, tree, tree);
|
|
static void replace_phi_edge_with_variable (basic_block, basic_block, edge,
|
|
tree, tree);
|
|
static basic_block *blocks_in_phiopt_order (void);
|
|
|
|
/* This pass eliminates PHI nodes which can be trivially implemented as
|
|
an assignment from a conditional expression. i.e. if we have something
|
|
like:
|
|
|
|
bb0:
|
|
if (cond) goto bb2; else goto bb1;
|
|
bb1:
|
|
bb2:
|
|
x = PHI (0 (bb1), 1 (bb0)
|
|
|
|
We can rewrite that as:
|
|
|
|
bb0:
|
|
bb1:
|
|
bb2:
|
|
x = cond;
|
|
|
|
bb1 will become unreachable and bb0 and bb2 will almost always
|
|
be merged into a single block. This occurs often due to gimplification
|
|
of conditionals.
|
|
|
|
Also done is the following optimization:
|
|
|
|
bb0:
|
|
if (a != b) goto bb2; else goto bb1;
|
|
bb1:
|
|
bb2:
|
|
x = PHI (a (bb1), b (bb0))
|
|
|
|
We can rewrite that as:
|
|
|
|
bb0:
|
|
bb1:
|
|
bb2:
|
|
x = b;
|
|
|
|
This can sometimes occur as a result of other optimizations. A
|
|
similar transformation is done by the ifcvt RTL optimizer.
|
|
|
|
This pass also eliminates PHI nodes which are really absolute
|
|
values. i.e. if we have something like:
|
|
|
|
bb0:
|
|
if (a >= 0) goto bb2; else goto bb1;
|
|
bb1:
|
|
x = -a;
|
|
bb2:
|
|
x = PHI (x (bb1), a (bb0));
|
|
|
|
We can rewrite that as:
|
|
|
|
bb0:
|
|
bb1:
|
|
bb2:
|
|
x = ABS_EXPR< a >;
|
|
|
|
Similarly,
|
|
|
|
bb0:
|
|
if (a <= b) goto bb2; else goto bb1;
|
|
bb1:
|
|
goto bb2;
|
|
bb2:
|
|
x = PHI (b (bb1), a (bb0));
|
|
|
|
Becomes
|
|
|
|
x = MIN_EXPR (a, b)
|
|
|
|
And the same transformation for MAX_EXPR.
|
|
|
|
bb1 will become unreachable and bb0 and bb2 will almost always be merged
|
|
into a single block. Similar transformations are done by the ifcvt
|
|
RTL optimizer. */
|
|
|
|
static void
|
|
tree_ssa_phiopt (void)
|
|
{
|
|
basic_block bb;
|
|
basic_block *bb_order;
|
|
unsigned n, i;
|
|
|
|
/* Search every basic block for COND_EXPR we may be able to optimize.
|
|
|
|
We walk the blocks in order that guarantees that a block with
|
|
a single predecessor is processed before the predecessor.
|
|
This ensures that we collapse inner ifs before visiting the
|
|
outer ones, and also that we do not try to visit a removed
|
|
block. */
|
|
bb_order = blocks_in_phiopt_order ();
|
|
n = n_basic_blocks;
|
|
|
|
for (i = 0; i < n; i++)
|
|
{
|
|
tree cond_expr;
|
|
tree phi;
|
|
basic_block bb1, bb2;
|
|
edge e1, e2;
|
|
tree arg0, arg1;
|
|
|
|
bb = bb_order[i];
|
|
|
|
cond_expr = last_stmt (bb);
|
|
/* Check to see if the last statement is a COND_EXPR. */
|
|
if (!cond_expr
|
|
|| TREE_CODE (cond_expr) != COND_EXPR)
|
|
continue;
|
|
|
|
e1 = EDGE_SUCC (bb, 0);
|
|
bb1 = e1->dest;
|
|
e2 = EDGE_SUCC (bb, 1);
|
|
bb2 = e2->dest;
|
|
|
|
/* We cannot do the optimization on abnormal edges. */
|
|
if ((e1->flags & EDGE_ABNORMAL) != 0
|
|
|| (e2->flags & EDGE_ABNORMAL) != 0)
|
|
continue;
|
|
|
|
/* If either bb1's succ or bb2 or bb2's succ is non NULL. */
|
|
if (EDGE_COUNT (bb1->succs) == 0
|
|
|| bb2 == NULL
|
|
|| EDGE_COUNT (bb2->succs) == 0)
|
|
continue;
|
|
|
|
/* Find the bb which is the fall through to the other. */
|
|
if (EDGE_SUCC (bb1, 0)->dest == bb2)
|
|
;
|
|
else if (EDGE_SUCC (bb2, 0)->dest == bb1)
|
|
{
|
|
basic_block bb_tmp = bb1;
|
|
edge e_tmp = e1;
|
|
bb1 = bb2;
|
|
bb2 = bb_tmp;
|
|
e1 = e2;
|
|
e2 = e_tmp;
|
|
}
|
|
else
|
|
continue;
|
|
|
|
e1 = EDGE_SUCC (bb1, 0);
|
|
|
|
/* Make sure that bb1 is just a fall through. */
|
|
if (!single_succ_p (bb1) > 1
|
|
|| (e1->flags & EDGE_FALLTHRU) == 0)
|
|
continue;
|
|
|
|
/* Also make that bb1 only have one pred and it is bb. */
|
|
if (!single_pred_p (bb1)
|
|
|| single_pred (bb1) != bb)
|
|
continue;
|
|
|
|
phi = phi_nodes (bb2);
|
|
|
|
/* Check to make sure that there is only one PHI node.
|
|
TODO: we could do it with more than one iff the other PHI nodes
|
|
have the same elements for these two edges. */
|
|
if (!phi || PHI_CHAIN (phi) != NULL)
|
|
continue;
|
|
|
|
arg0 = PHI_ARG_DEF_TREE (phi, e1->dest_idx);
|
|
arg1 = PHI_ARG_DEF_TREE (phi, e2->dest_idx);
|
|
|
|
/* We know something is wrong if we cannot find the edges in the PHI
|
|
node. */
|
|
gcc_assert (arg0 != NULL && arg1 != NULL);
|
|
|
|
/* Do the replacement of conditional if it can be done. */
|
|
if (conditional_replacement (bb, bb1, bb2, e1, e2, phi, arg0, arg1))
|
|
;
|
|
else if (value_replacement (bb, bb1, bb2, e1, e2, phi, arg0, arg1))
|
|
;
|
|
else if (abs_replacement (bb, bb1, bb2, e1, e2, phi, arg0, arg1))
|
|
;
|
|
else
|
|
minmax_replacement (bb, bb1, bb2, e1, e2, phi, arg0, arg1);
|
|
}
|
|
|
|
free (bb_order);
|
|
}
|
|
|
|
/* Returns the list of basic blocks in the function in an order that guarantees
|
|
that if a block X has just a single predecessor Y, then Y is after X in the
|
|
ordering. */
|
|
|
|
static basic_block *
|
|
blocks_in_phiopt_order (void)
|
|
{
|
|
basic_block x, y;
|
|
basic_block *order = xmalloc (sizeof (basic_block) * n_basic_blocks);
|
|
unsigned n = n_basic_blocks, np, i;
|
|
sbitmap visited = sbitmap_alloc (last_basic_block + 2);
|
|
|
|
#define MARK_VISITED(BB) (SET_BIT (visited, (BB)->index + 2))
|
|
#define VISITED_P(BB) (TEST_BIT (visited, (BB)->index + 2))
|
|
|
|
sbitmap_zero (visited);
|
|
|
|
MARK_VISITED (ENTRY_BLOCK_PTR);
|
|
FOR_EACH_BB (x)
|
|
{
|
|
if (VISITED_P (x))
|
|
continue;
|
|
|
|
/* Walk the predecessors of x as long as they have precisely one
|
|
predecessor and add them to the list, so that they get stored
|
|
after x. */
|
|
for (y = x, np = 1;
|
|
single_pred_p (y) && !VISITED_P (single_pred (y));
|
|
y = single_pred (y))
|
|
np++;
|
|
for (y = x, i = n - np;
|
|
single_pred_p (y) && !VISITED_P (single_pred (y));
|
|
y = single_pred (y), i++)
|
|
{
|
|
order[i] = y;
|
|
MARK_VISITED (y);
|
|
}
|
|
order[i] = y;
|
|
MARK_VISITED (y);
|
|
|
|
gcc_assert (i == n - 1);
|
|
n -= np;
|
|
}
|
|
|
|
sbitmap_free (visited);
|
|
gcc_assert (n == 0);
|
|
return order;
|
|
|
|
#undef MARK_VISITED
|
|
#undef VISITED_P
|
|
}
|
|
|
|
/* Return TRUE if block BB has no executable statements, otherwise return
|
|
FALSE. */
|
|
bool
|
|
empty_block_p (basic_block bb)
|
|
{
|
|
block_stmt_iterator bsi;
|
|
|
|
/* BB must have no executable statements. */
|
|
bsi = bsi_start (bb);
|
|
while (!bsi_end_p (bsi)
|
|
&& (TREE_CODE (bsi_stmt (bsi)) == LABEL_EXPR
|
|
|| IS_EMPTY_STMT (bsi_stmt (bsi))))
|
|
bsi_next (&bsi);
|
|
|
|
if (!bsi_end_p (bsi))
|
|
return false;
|
|
|
|
return true;
|
|
}
|
|
|
|
/* Replace PHI node element whoes edge is E in block BB with variable NEW.
|
|
Remove the edge from COND_BLOCK which does not lead to BB (COND_BLOCK
|
|
is known to have two edges, one of which must reach BB). */
|
|
|
|
static void
|
|
replace_phi_edge_with_variable (basic_block cond_block, basic_block bb,
|
|
edge e, tree phi, tree new)
|
|
{
|
|
basic_block block_to_remove;
|
|
block_stmt_iterator bsi;
|
|
|
|
/* Change the PHI argument to new. */
|
|
SET_USE (PHI_ARG_DEF_PTR (phi, e->dest_idx), new);
|
|
|
|
/* Remove the empty basic block. */
|
|
if (EDGE_SUCC (cond_block, 0)->dest == bb)
|
|
{
|
|
EDGE_SUCC (cond_block, 0)->flags |= EDGE_FALLTHRU;
|
|
EDGE_SUCC (cond_block, 0)->flags &= ~(EDGE_TRUE_VALUE | EDGE_FALSE_VALUE);
|
|
|
|
block_to_remove = EDGE_SUCC (cond_block, 1)->dest;
|
|
}
|
|
else
|
|
{
|
|
EDGE_SUCC (cond_block, 1)->flags |= EDGE_FALLTHRU;
|
|
EDGE_SUCC (cond_block, 1)->flags
|
|
&= ~(EDGE_TRUE_VALUE | EDGE_FALSE_VALUE);
|
|
|
|
block_to_remove = EDGE_SUCC (cond_block, 0)->dest;
|
|
}
|
|
delete_basic_block (block_to_remove);
|
|
|
|
/* Eliminate the COND_EXPR at the end of COND_BLOCK. */
|
|
bsi = bsi_last (cond_block);
|
|
bsi_remove (&bsi);
|
|
|
|
if (dump_file && (dump_flags & TDF_DETAILS))
|
|
fprintf (dump_file,
|
|
"COND_EXPR in block %d and PHI in block %d converted to straightline code.\n",
|
|
cond_block->index,
|
|
bb->index);
|
|
}
|
|
|
|
/* The function conditional_replacement does the main work of doing the
|
|
conditional replacement. Return true if the replacement is done.
|
|
Otherwise return false.
|
|
BB is the basic block where the replacement is going to be done on. ARG0
|
|
is argument 0 from PHI. Likewise for ARG1. */
|
|
|
|
static bool
|
|
conditional_replacement (basic_block cond_bb, basic_block middle_bb,
|
|
basic_block phi_bb, edge e0, edge e1, tree phi,
|
|
tree arg0, tree arg1)
|
|
{
|
|
tree result;
|
|
tree old_result = NULL;
|
|
tree new, cond;
|
|
block_stmt_iterator bsi;
|
|
edge true_edge, false_edge;
|
|
tree new_var = NULL;
|
|
tree new_var1;
|
|
|
|
/* The PHI arguments have the constants 0 and 1, then convert
|
|
it to the conditional. */
|
|
if ((integer_zerop (arg0) && integer_onep (arg1))
|
|
|| (integer_zerop (arg1) && integer_onep (arg0)))
|
|
;
|
|
else
|
|
return false;
|
|
|
|
if (!empty_block_p (middle_bb))
|
|
return false;
|
|
|
|
/* If the condition is not a naked SSA_NAME and its type does not
|
|
match the type of the result, then we have to create a new
|
|
variable to optimize this case as it would likely create
|
|
non-gimple code when the condition was converted to the
|
|
result's type. */
|
|
cond = COND_EXPR_COND (last_stmt (cond_bb));
|
|
result = PHI_RESULT (phi);
|
|
if (TREE_CODE (cond) != SSA_NAME
|
|
&& !lang_hooks.types_compatible_p (TREE_TYPE (cond), TREE_TYPE (result)))
|
|
{
|
|
new_var = make_rename_temp (TREE_TYPE (cond), NULL);
|
|
old_result = cond;
|
|
cond = new_var;
|
|
}
|
|
|
|
/* If the condition was a naked SSA_NAME and the type is not the
|
|
same as the type of the result, then convert the type of the
|
|
condition. */
|
|
if (!lang_hooks.types_compatible_p (TREE_TYPE (cond), TREE_TYPE (result)))
|
|
cond = fold_convert (TREE_TYPE (result), cond);
|
|
|
|
/* We need to know which is the true edge and which is the false
|
|
edge so that we know when to invert the condition below. */
|
|
extract_true_false_edges_from_block (cond_bb, &true_edge, &false_edge);
|
|
|
|
/* Insert our new statement at the end of conditional block before the
|
|
COND_EXPR. */
|
|
bsi = bsi_last (cond_bb);
|
|
bsi_insert_before (&bsi, build_empty_stmt (), BSI_NEW_STMT);
|
|
|
|
if (old_result)
|
|
{
|
|
tree new1;
|
|
if (!COMPARISON_CLASS_P (old_result))
|
|
return false;
|
|
|
|
new1 = build2 (TREE_CODE (old_result), TREE_TYPE (old_result),
|
|
TREE_OPERAND (old_result, 0),
|
|
TREE_OPERAND (old_result, 1));
|
|
|
|
new1 = build2 (MODIFY_EXPR, TREE_TYPE (old_result), new_var, new1);
|
|
bsi_insert_after (&bsi, new1, BSI_NEW_STMT);
|
|
}
|
|
|
|
new_var1 = duplicate_ssa_name (PHI_RESULT (phi), NULL);
|
|
|
|
|
|
/* At this point we know we have a COND_EXPR with two successors.
|
|
One successor is BB, the other successor is an empty block which
|
|
falls through into BB.
|
|
|
|
There is a single PHI node at the join point (BB) and its arguments
|
|
are constants (0, 1).
|
|
|
|
So, given the condition COND, and the two PHI arguments, we can
|
|
rewrite this PHI into non-branching code:
|
|
|
|
dest = (COND) or dest = COND'
|
|
|
|
We use the condition as-is if the argument associated with the
|
|
true edge has the value one or the argument associated with the
|
|
false edge as the value zero. Note that those conditions are not
|
|
the same since only one of the outgoing edges from the COND_EXPR
|
|
will directly reach BB and thus be associated with an argument. */
|
|
if ((e0 == true_edge && integer_onep (arg0))
|
|
|| (e0 == false_edge && integer_zerop (arg0))
|
|
|| (e1 == true_edge && integer_onep (arg1))
|
|
|| (e1 == false_edge && integer_zerop (arg1)))
|
|
{
|
|
new = build2 (MODIFY_EXPR, TREE_TYPE (new_var1), new_var1, cond);
|
|
}
|
|
else
|
|
{
|
|
tree cond1 = invert_truthvalue (cond);
|
|
|
|
cond = cond1;
|
|
/* If what we get back is a conditional expression, there is no
|
|
way that it can be gimple. */
|
|
if (TREE_CODE (cond) == COND_EXPR)
|
|
{
|
|
release_ssa_name (new_var1);
|
|
return false;
|
|
}
|
|
|
|
/* If what we get back is not gimple try to create it as gimple by
|
|
using a temporary variable. */
|
|
if (is_gimple_cast (cond)
|
|
&& !is_gimple_val (TREE_OPERAND (cond, 0)))
|
|
{
|
|
tree temp = TREE_OPERAND (cond, 0);
|
|
tree new_var_1 = make_rename_temp (TREE_TYPE (temp), NULL);
|
|
new = build2 (MODIFY_EXPR, TREE_TYPE (new_var_1), new_var_1, temp);
|
|
bsi_insert_after (&bsi, new, BSI_NEW_STMT);
|
|
cond = fold_convert (TREE_TYPE (result), new_var_1);
|
|
}
|
|
|
|
if (TREE_CODE (cond) == TRUTH_NOT_EXPR
|
|
&& !is_gimple_val (TREE_OPERAND (cond, 0)))
|
|
{
|
|
release_ssa_name (new_var1);
|
|
return false;
|
|
}
|
|
|
|
new = build2 (MODIFY_EXPR, TREE_TYPE (new_var1), new_var1, cond);
|
|
}
|
|
|
|
bsi_insert_after (&bsi, new, BSI_NEW_STMT);
|
|
|
|
SSA_NAME_DEF_STMT (new_var1) = new;
|
|
|
|
replace_phi_edge_with_variable (cond_bb, phi_bb, e1, phi, new_var1);
|
|
|
|
/* Note that we optimized this PHI. */
|
|
return true;
|
|
}
|
|
|
|
/* The function value_replacement does the main work of doing the value
|
|
replacement. Return true if the replacement is done. Otherwise return
|
|
false.
|
|
BB is the basic block where the replacement is going to be done on. ARG0
|
|
is argument 0 from the PHI. Likewise for ARG1. */
|
|
|
|
static bool
|
|
value_replacement (basic_block cond_bb, basic_block middle_bb,
|
|
basic_block phi_bb, edge e0, edge e1, tree phi,
|
|
tree arg0, tree arg1)
|
|
{
|
|
tree cond;
|
|
edge true_edge, false_edge;
|
|
|
|
/* If the type says honor signed zeros we cannot do this
|
|
optimization. */
|
|
if (HONOR_SIGNED_ZEROS (TYPE_MODE (TREE_TYPE (arg1))))
|
|
return false;
|
|
|
|
if (!empty_block_p (middle_bb))
|
|
return false;
|
|
|
|
cond = COND_EXPR_COND (last_stmt (cond_bb));
|
|
|
|
/* This transformation is only valid for equality comparisons. */
|
|
if (TREE_CODE (cond) != NE_EXPR && TREE_CODE (cond) != EQ_EXPR)
|
|
return false;
|
|
|
|
/* We need to know which is the true edge and which is the false
|
|
edge so that we know if have abs or negative abs. */
|
|
extract_true_false_edges_from_block (cond_bb, &true_edge, &false_edge);
|
|
|
|
/* At this point we know we have a COND_EXPR with two successors.
|
|
One successor is BB, the other successor is an empty block which
|
|
falls through into BB.
|
|
|
|
The condition for the COND_EXPR is known to be NE_EXPR or EQ_EXPR.
|
|
|
|
There is a single PHI node at the join point (BB) with two arguments.
|
|
|
|
We now need to verify that the two arguments in the PHI node match
|
|
the two arguments to the equality comparison. */
|
|
|
|
if ((operand_equal_for_phi_arg_p (arg0, TREE_OPERAND (cond, 0))
|
|
&& operand_equal_for_phi_arg_p (arg1, TREE_OPERAND (cond, 1)))
|
|
|| (operand_equal_for_phi_arg_p (arg1, TREE_OPERAND (cond, 0))
|
|
&& operand_equal_for_phi_arg_p (arg0, TREE_OPERAND (cond, 1))))
|
|
{
|
|
edge e;
|
|
tree arg;
|
|
|
|
/* For NE_EXPR, we want to build an assignment result = arg where
|
|
arg is the PHI argument associated with the true edge. For
|
|
EQ_EXPR we want the PHI argument associated with the false edge. */
|
|
e = (TREE_CODE (cond) == NE_EXPR ? true_edge : false_edge);
|
|
|
|
/* Unfortunately, E may not reach BB (it may instead have gone to
|
|
OTHER_BLOCK). If that is the case, then we want the single outgoing
|
|
edge from OTHER_BLOCK which reaches BB and represents the desired
|
|
path from COND_BLOCK. */
|
|
if (e->dest == middle_bb)
|
|
e = single_succ_edge (e->dest);
|
|
|
|
/* Now we know the incoming edge to BB that has the argument for the
|
|
RHS of our new assignment statement. */
|
|
if (e0 == e)
|
|
arg = arg0;
|
|
else
|
|
arg = arg1;
|
|
|
|
replace_phi_edge_with_variable (cond_bb, phi_bb, e1, phi, arg);
|
|
|
|
/* Note that we optimized this PHI. */
|
|
return true;
|
|
}
|
|
return false;
|
|
}
|
|
|
|
/* The function minmax_replacement does the main work of doing the minmax
|
|
replacement. Return true if the replacement is done. Otherwise return
|
|
false.
|
|
BB is the basic block where the replacement is going to be done on. ARG0
|
|
is argument 0 from the PHI. Likewise for ARG1. */
|
|
|
|
static bool
|
|
minmax_replacement (basic_block cond_bb, basic_block middle_bb,
|
|
basic_block phi_bb, edge e0, edge e1, tree phi,
|
|
tree arg0, tree arg1)
|
|
{
|
|
tree result, type;
|
|
tree cond, new;
|
|
edge true_edge, false_edge;
|
|
enum tree_code cmp, minmax, ass_code;
|
|
tree smaller, larger, arg_true, arg_false;
|
|
block_stmt_iterator bsi, bsi_from;
|
|
|
|
type = TREE_TYPE (PHI_RESULT (phi));
|
|
|
|
/* The optimization may be unsafe due to NaNs. */
|
|
if (HONOR_NANS (TYPE_MODE (type)))
|
|
return false;
|
|
|
|
cond = COND_EXPR_COND (last_stmt (cond_bb));
|
|
cmp = TREE_CODE (cond);
|
|
result = PHI_RESULT (phi);
|
|
|
|
/* This transformation is only valid for order comparisons. Record which
|
|
operand is smaller/larger if the result of the comparison is true. */
|
|
if (cmp == LT_EXPR || cmp == LE_EXPR)
|
|
{
|
|
smaller = TREE_OPERAND (cond, 0);
|
|
larger = TREE_OPERAND (cond, 1);
|
|
}
|
|
else if (cmp == GT_EXPR || cmp == GE_EXPR)
|
|
{
|
|
smaller = TREE_OPERAND (cond, 1);
|
|
larger = TREE_OPERAND (cond, 0);
|
|
}
|
|
else
|
|
return false;
|
|
|
|
/* We need to know which is the true edge and which is the false
|
|
edge so that we know if have abs or negative abs. */
|
|
extract_true_false_edges_from_block (cond_bb, &true_edge, &false_edge);
|
|
|
|
/* Forward the edges over the middle basic block. */
|
|
if (true_edge->dest == middle_bb)
|
|
true_edge = EDGE_SUCC (true_edge->dest, 0);
|
|
if (false_edge->dest == middle_bb)
|
|
false_edge = EDGE_SUCC (false_edge->dest, 0);
|
|
|
|
if (true_edge == e0)
|
|
{
|
|
gcc_assert (false_edge == e1);
|
|
arg_true = arg0;
|
|
arg_false = arg1;
|
|
}
|
|
else
|
|
{
|
|
gcc_assert (false_edge == e0);
|
|
gcc_assert (true_edge == e1);
|
|
arg_true = arg1;
|
|
arg_false = arg0;
|
|
}
|
|
|
|
if (empty_block_p (middle_bb))
|
|
{
|
|
if (operand_equal_for_phi_arg_p (arg_true, smaller)
|
|
&& operand_equal_for_phi_arg_p (arg_false, larger))
|
|
{
|
|
/* Case
|
|
|
|
if (smaller < larger)
|
|
rslt = smaller;
|
|
else
|
|
rslt = larger; */
|
|
minmax = MIN_EXPR;
|
|
}
|
|
else if (operand_equal_for_phi_arg_p (arg_false, smaller)
|
|
&& operand_equal_for_phi_arg_p (arg_true, larger))
|
|
minmax = MAX_EXPR;
|
|
else
|
|
return false;
|
|
}
|
|
else
|
|
{
|
|
/* Recognize the following case, assuming d <= u:
|
|
|
|
if (a <= u)
|
|
b = MAX (a, d);
|
|
x = PHI <b, u>
|
|
|
|
This is equivalent to
|
|
|
|
b = MAX (a, d);
|
|
x = MIN (b, u); */
|
|
|
|
tree assign = last_and_only_stmt (middle_bb);
|
|
tree lhs, rhs, op0, op1, bound;
|
|
|
|
if (!assign
|
|
|| TREE_CODE (assign) != MODIFY_EXPR)
|
|
return false;
|
|
|
|
lhs = TREE_OPERAND (assign, 0);
|
|
rhs = TREE_OPERAND (assign, 1);
|
|
ass_code = TREE_CODE (rhs);
|
|
if (ass_code != MAX_EXPR && ass_code != MIN_EXPR)
|
|
return false;
|
|
op0 = TREE_OPERAND (rhs, 0);
|
|
op1 = TREE_OPERAND (rhs, 1);
|
|
|
|
if (true_edge->src == middle_bb)
|
|
{
|
|
/* We got here if the condition is true, i.e., SMALLER < LARGER. */
|
|
if (!operand_equal_for_phi_arg_p (lhs, arg_true))
|
|
return false;
|
|
|
|
if (operand_equal_for_phi_arg_p (arg_false, larger))
|
|
{
|
|
/* Case
|
|
|
|
if (smaller < larger)
|
|
{
|
|
r' = MAX_EXPR (smaller, bound)
|
|
}
|
|
r = PHI <r', larger> --> to be turned to MIN_EXPR. */
|
|
if (ass_code != MAX_EXPR)
|
|
return false;
|
|
|
|
minmax = MIN_EXPR;
|
|
if (operand_equal_for_phi_arg_p (op0, smaller))
|
|
bound = op1;
|
|
else if (operand_equal_for_phi_arg_p (op1, smaller))
|
|
bound = op0;
|
|
else
|
|
return false;
|
|
|
|
/* We need BOUND <= LARGER. */
|
|
if (!integer_nonzerop (fold (build2 (LE_EXPR, boolean_type_node,
|
|
bound, larger))))
|
|
return false;
|
|
}
|
|
else if (operand_equal_for_phi_arg_p (arg_false, smaller))
|
|
{
|
|
/* Case
|
|
|
|
if (smaller < larger)
|
|
{
|
|
r' = MIN_EXPR (larger, bound)
|
|
}
|
|
r = PHI <r', smaller> --> to be turned to MAX_EXPR. */
|
|
if (ass_code != MIN_EXPR)
|
|
return false;
|
|
|
|
minmax = MAX_EXPR;
|
|
if (operand_equal_for_phi_arg_p (op0, larger))
|
|
bound = op1;
|
|
else if (operand_equal_for_phi_arg_p (op1, larger))
|
|
bound = op0;
|
|
else
|
|
return false;
|
|
|
|
/* We need BOUND >= SMALLER. */
|
|
if (!integer_nonzerop (fold (build2 (GE_EXPR, boolean_type_node,
|
|
bound, smaller))))
|
|
return false;
|
|
}
|
|
else
|
|
return false;
|
|
}
|
|
else
|
|
{
|
|
/* We got here if the condition is false, i.e., SMALLER > LARGER. */
|
|
if (!operand_equal_for_phi_arg_p (lhs, arg_false))
|
|
return false;
|
|
|
|
if (operand_equal_for_phi_arg_p (arg_true, larger))
|
|
{
|
|
/* Case
|
|
|
|
if (smaller > larger)
|
|
{
|
|
r' = MIN_EXPR (smaller, bound)
|
|
}
|
|
r = PHI <r', larger> --> to be turned to MAX_EXPR. */
|
|
if (ass_code != MIN_EXPR)
|
|
return false;
|
|
|
|
minmax = MAX_EXPR;
|
|
if (operand_equal_for_phi_arg_p (op0, smaller))
|
|
bound = op1;
|
|
else if (operand_equal_for_phi_arg_p (op1, smaller))
|
|
bound = op0;
|
|
else
|
|
return false;
|
|
|
|
/* We need BOUND >= LARGER. */
|
|
if (!integer_nonzerop (fold (build2 (GE_EXPR, boolean_type_node,
|
|
bound, larger))))
|
|
return false;
|
|
}
|
|
else if (operand_equal_for_phi_arg_p (arg_true, smaller))
|
|
{
|
|
/* Case
|
|
|
|
if (smaller > larger)
|
|
{
|
|
r' = MAX_EXPR (larger, bound)
|
|
}
|
|
r = PHI <r', smaller> --> to be turned to MIN_EXPR. */
|
|
if (ass_code != MAX_EXPR)
|
|
return false;
|
|
|
|
minmax = MIN_EXPR;
|
|
if (operand_equal_for_phi_arg_p (op0, larger))
|
|
bound = op1;
|
|
else if (operand_equal_for_phi_arg_p (op1, larger))
|
|
bound = op0;
|
|
else
|
|
return false;
|
|
|
|
/* We need BOUND <= SMALLER. */
|
|
if (!integer_nonzerop (fold (build2 (LE_EXPR, boolean_type_node,
|
|
bound, smaller))))
|
|
return false;
|
|
}
|
|
else
|
|
return false;
|
|
}
|
|
|
|
/* Move the statement from the middle block. */
|
|
bsi = bsi_last (cond_bb);
|
|
bsi_from = bsi_last (middle_bb);
|
|
bsi_move_before (&bsi_from, &bsi);
|
|
}
|
|
|
|
/* Emit the statement to compute min/max. */
|
|
result = duplicate_ssa_name (PHI_RESULT (phi), NULL);
|
|
new = build2 (MODIFY_EXPR, type, result,
|
|
build2 (minmax, type, arg0, arg1));
|
|
SSA_NAME_DEF_STMT (result) = new;
|
|
bsi = bsi_last (cond_bb);
|
|
bsi_insert_before (&bsi, new, BSI_NEW_STMT);
|
|
|
|
replace_phi_edge_with_variable (cond_bb, phi_bb, e1, phi, result);
|
|
return true;
|
|
}
|
|
|
|
/* The function absolute_replacement does the main work of doing the absolute
|
|
replacement. Return true if the replacement is done. Otherwise return
|
|
false.
|
|
bb is the basic block where the replacement is going to be done on. arg0
|
|
is argument 0 from the phi. Likewise for arg1. */
|
|
|
|
static bool
|
|
abs_replacement (basic_block cond_bb, basic_block middle_bb,
|
|
basic_block phi_bb, edge e0 ATTRIBUTE_UNUSED, edge e1,
|
|
tree phi, tree arg0, tree arg1)
|
|
{
|
|
tree result;
|
|
tree new, cond;
|
|
block_stmt_iterator bsi;
|
|
edge true_edge, false_edge;
|
|
tree assign;
|
|
edge e;
|
|
tree rhs, lhs;
|
|
bool negate;
|
|
enum tree_code cond_code;
|
|
|
|
/* If the type says honor signed zeros we cannot do this
|
|
optimization. */
|
|
if (HONOR_SIGNED_ZEROS (TYPE_MODE (TREE_TYPE (arg1))))
|
|
return false;
|
|
|
|
/* OTHER_BLOCK must have only one executable statement which must have the
|
|
form arg0 = -arg1 or arg1 = -arg0. */
|
|
|
|
assign = last_and_only_stmt (middle_bb);
|
|
/* If we did not find the proper negation assignment, then we can not
|
|
optimize. */
|
|
if (assign == NULL)
|
|
return false;
|
|
|
|
/* If we got here, then we have found the only executable statement
|
|
in OTHER_BLOCK. If it is anything other than arg = -arg1 or
|
|
arg1 = -arg0, then we can not optimize. */
|
|
if (TREE_CODE (assign) != MODIFY_EXPR)
|
|
return false;
|
|
|
|
lhs = TREE_OPERAND (assign, 0);
|
|
rhs = TREE_OPERAND (assign, 1);
|
|
|
|
if (TREE_CODE (rhs) != NEGATE_EXPR)
|
|
return false;
|
|
|
|
rhs = TREE_OPERAND (rhs, 0);
|
|
|
|
/* The assignment has to be arg0 = -arg1 or arg1 = -arg0. */
|
|
if (!(lhs == arg0 && rhs == arg1)
|
|
&& !(lhs == arg1 && rhs == arg0))
|
|
return false;
|
|
|
|
cond = COND_EXPR_COND (last_stmt (cond_bb));
|
|
result = PHI_RESULT (phi);
|
|
|
|
/* Only relationals comparing arg[01] against zero are interesting. */
|
|
cond_code = TREE_CODE (cond);
|
|
if (cond_code != GT_EXPR && cond_code != GE_EXPR
|
|
&& cond_code != LT_EXPR && cond_code != LE_EXPR)
|
|
return false;
|
|
|
|
/* Make sure the conditional is arg[01] OP y. */
|
|
if (TREE_OPERAND (cond, 0) != rhs)
|
|
return false;
|
|
|
|
if (FLOAT_TYPE_P (TREE_TYPE (TREE_OPERAND (cond, 1)))
|
|
? real_zerop (TREE_OPERAND (cond, 1))
|
|
: integer_zerop (TREE_OPERAND (cond, 1)))
|
|
;
|
|
else
|
|
return false;
|
|
|
|
/* We need to know which is the true edge and which is the false
|
|
edge so that we know if have abs or negative abs. */
|
|
extract_true_false_edges_from_block (cond_bb, &true_edge, &false_edge);
|
|
|
|
/* For GT_EXPR/GE_EXPR, if the true edge goes to OTHER_BLOCK, then we
|
|
will need to negate the result. Similarly for LT_EXPR/LE_EXPR if
|
|
the false edge goes to OTHER_BLOCK. */
|
|
if (cond_code == GT_EXPR || cond_code == GE_EXPR)
|
|
e = true_edge;
|
|
else
|
|
e = false_edge;
|
|
|
|
if (e->dest == middle_bb)
|
|
negate = true;
|
|
else
|
|
negate = false;
|
|
|
|
result = duplicate_ssa_name (result, NULL);
|
|
|
|
if (negate)
|
|
lhs = make_rename_temp (TREE_TYPE (result), NULL);
|
|
else
|
|
lhs = result;
|
|
|
|
/* Build the modify expression with abs expression. */
|
|
new = build2 (MODIFY_EXPR, TREE_TYPE (lhs),
|
|
lhs, build1 (ABS_EXPR, TREE_TYPE (lhs), rhs));
|
|
|
|
bsi = bsi_last (cond_bb);
|
|
bsi_insert_before (&bsi, new, BSI_NEW_STMT);
|
|
|
|
if (negate)
|
|
{
|
|
/* Get the right BSI. We want to insert after the recently
|
|
added ABS_EXPR statement (which we know is the first statement
|
|
in the block. */
|
|
new = build2 (MODIFY_EXPR, TREE_TYPE (result),
|
|
result, build1 (NEGATE_EXPR, TREE_TYPE (lhs), lhs));
|
|
|
|
bsi_insert_after (&bsi, new, BSI_NEW_STMT);
|
|
}
|
|
|
|
SSA_NAME_DEF_STMT (result) = new;
|
|
replace_phi_edge_with_variable (cond_bb, phi_bb, e1, phi, result);
|
|
|
|
/* Note that we optimized this PHI. */
|
|
return true;
|
|
}
|
|
|
|
|
|
/* Always do these optimizations if we have SSA
|
|
trees to work on. */
|
|
static bool
|
|
gate_phiopt (void)
|
|
{
|
|
return 1;
|
|
}
|
|
|
|
struct tree_opt_pass pass_phiopt =
|
|
{
|
|
"phiopt", /* name */
|
|
gate_phiopt, /* gate */
|
|
tree_ssa_phiopt, /* execute */
|
|
NULL, /* sub */
|
|
NULL, /* next */
|
|
0, /* static_pass_number */
|
|
TV_TREE_PHIOPT, /* tv_id */
|
|
PROP_cfg | PROP_ssa | PROP_alias, /* properties_required */
|
|
0, /* properties_provided */
|
|
0, /* properties_destroyed */
|
|
0, /* todo_flags_start */
|
|
TODO_cleanup_cfg
|
|
| TODO_dump_func
|
|
| TODO_ggc_collect
|
|
| TODO_verify_ssa
|
|
| TODO_update_ssa
|
|
| TODO_verify_flow
|
|
| TODO_verify_stmts, /* todo_flags_finish */
|
|
0 /* letter */
|
|
};
|