2006-12-11 Diego Novillo <dnovillo@redhat.com> * doc/tree-ssa.texi: Update documentation for virtual operands and the use of push_stmt_changes/pop_stmt_changes. * doc/invoke.texi: Remove documentation for params global-var-threshold. Update documentation on max-aliased-vops. * tree-into-ssa.c: Cleanup comments, variables and spacing in various functions. (regs_to_rename): Declare. (mem_syms_to_rename): Declare. (dump_update_ssa): Declare. (debug_update_ssa): Declare. (dump_names_replaced_by): Declare. (debug_names_replaced_by): Declare. (dump_def_blocks): Declare. (debug_def_blocks): Declare. (dump_defs_stack): Declare. (debug_defs_stack): Declare. (dump_currdefs): Declare. (debug_currdefs): Declare. (mark_def_sites): Do not handle virtual operands. (compute_idf): Rename from find_idf. Update users. (register_new_def): Make local. Convert second argument to 'tree'. Use BLOCK_DEFS_STACK directly. If pushing a non-register, also push the underlying symbol. (rewrite_stmt): Do not handle virtual operands. (dump_tree_ssa): Call dump_def_blocks, dump_defs_stack, dump_currdefs and dump_tree_ssa_stats. (dump_tree_ssa_stats): Also dump REPL_TBL. (replace_use): Remove. Update all users to call SET_USE instead. (rewrite_blocks): Move code to free memory to fini_ssa_renamer. (mark_def_site_blocks): Move initialization code to init_ssa_renamer. (init_ssa_renamer): New. (fini_ssa_renamer): New. (rewrite_into_ssa): Call them. (prepare_block_for_update): Process SSA_OP_ALL_USES first and SSA_OP_ALL_DEFS later. Do not process virtual operands separately. (dump_update_ssa): Call dump_decl_set. (init_update_ssa): Initialize regs_to_rename and mem_syms_to_rename. Call init_ssa_renamer. (delete_update_ssa): Call fini_ssa_renamer. Free blocks_with_phis_to_rewrite. (mark_sym_for_renaming): If the variable has sub-variables, also mark them. If the variable belongs to a partition, also mark it. (mark_set_for_renaming): Call mark_sym_for_renaming on every symbol in the set. (switch_virtuals_to_full_rewrite): Call mark_set_for_renaming. (update_ssa): Separate syms_to_rename into regs_to_rename and mem_syms_to_rename. * tree-dump.c (dump_options): Add TDF_MEMSYMS. * tree-pretty-print.c (debug_generic_expr): Add TDF_MEMSYMS. (debug_generic_stmt): Likewise. (debug_tree_chain): Likewise. (dump_symbols): New. (dump_generic_node): Check for TDF_MEMSYMS. Handle MEMORY_PARTITION_TAG. If the statement references memory and TDF_MEMSYMS is given, call dump_symbols. Indicate default names with (D). (dump_vops): Update for new virtual operator format. * tree.c (init_ttree): Add MEMORY_PARTITION_TAG to tree_contains_struct. (tree_code_size): Handle MEMORY_PARTITION_TAG. (tree_node_structure): Likewise. (needs_to_live_in_memory): Handle SSA names. * tree.h (MTAG_P): Likewise. (struct tree_memory_partition_tag): Declare. (MPT_SYMBOLS): Define. (union tree_node): Add field 'mpt'. * treestruct.def (TS_MEMORY_PARTITION_TAG): Define. * tree.def (MEMORY_PARTITION_TAG): Define. * tree-pass.h (TDF_MEMSYMS): Define. * params.h (GLOBAL_VAR_THRESHOLD): Remove. * tree-ssa-alias.c: Include pointer-set.h (struct alias_map_d): Remove fields total_alias_vops, grouped_p and may_aliases. Update all users. (struct mp_info_def): Declare. (mp_info_t): New type. (get_smt_for): Rename from get_tmt_for. Update all users. (add_may_alias): Add argument ALREADY_ADDED. If given, use it to avoid adding duplicate entries to alias sets. (replace_may_alias): Remove. Update all users. (total_alias_vops_cmp): Remove. Update all users. (group_aliases_into): Remove. Update all users. (tree_pointer_compare): Remove. Update all users. (compact_name_tags): Remove. Update all users. (group_aliases): Remove. Update all users. (mark_non_addressable): Move from tree-flow-inline.h. Remove the symbol from the partition holding it, if needed. (dump_mp_info): New. (debug_mp_info): New. (sort_mp_info): New. (create_partition_for): New. (rewrite_alias_set_for): New. (compute_memory_partitions): New. (compute_may_aliases): Call it. (init_alias_info): If computing aliases for the first time, mark every memory symbol for renaming. (have_common_aliases_p): New. (compute_flow_insensitive_aliasing): Call it. (setup_pointers_and_addressables): Do not cache num_referenced_vars. For register promoted symbols, mark their former partition for renaming. (maybe_create_global_var): Only create .GLOBAL_VAR if there are no call-clobbered variables and a mix of pure and non-pure functions were found. (may_alias_p): Tidy comments. (create_tag_raw): Remove unused variable new_type. (dump_alias_info): call dump_memory_partitions. (dump_points_to_info_for): Call dump_decl_set. (may_be_aliased): Tidy comments and formatting. * timevar.def (TV_MEMORY_PARTITIONING): Define. * tree-vectorizer.c (vect_memsyms_to_rename): Rename from vect_vnames_to_rename. Set DECL_UIDs instead of SSA name versions in it. (slpeel_update_phi_nodes_for_guard1): Ignore memory PHIs. * tree-vect-transform.c (vect_transform_loop): Call mark_set_for_renaming with vect_memsyms_to_rename. * tree-flow-inline.h (zero_imm_uses_p): New. (memory_partition): New. (set_memory_partition): New. (factoring_name_p): New. (symbol_mem_tag): New. Update every function that used to access the annotation directly. (set_symbol_mem_tag): Likewise. * tree-ssa-copy.c (may_propagate_copy): Allow copies between a partition and a symbol as long as the symbol belongs to the partition. (merge_alias_info): Ignore merge requests when memory partitions are involved. * tree-ssa.c (verify_ssa_name): Check that default definitions have empty defining statements. (verify_use): Remove argument IS_VIRTUAL. Don't call verify_ssa_name. (verify_phi_args): Call verify_ssa_name. (verify_flow_insensitive_alias_info): Handle MPTs. (verify_flow_sensitive_alias_info): Likewise. (verify_name_tags): Likewise. (verify_call_clobbering): Likewise. (verify_ssa): Check for VOPs only after aliasing information is available. Check virtuals and real operands separately. Call verify_ssa_name on every operand. (stmt_references_memory_p): Move to tree-ssa-operands.c. (walk_use_def_chains_1): Guard against NULL PHI arguments. * tree-ssa-operands.c (stmt_references_memory_p): Move from tree-ssa.c. (get_mpt_for): New. (dump_memory_partitions): New. (debug_memory_partitions): New. * tree-flow.h (struct var_ann_d): Add field mpt. (struct stmt_ann_d): Add bitfield references_memory. * Makefile.in (tree-ssa-structalias.o): Include pointer-set.h (tree-ssa-alias.o): Likewise. * tree-ssa-structalias.c: (update_alias_info): Use STORED_SYMS to determine which variables are being written to by the store operation. * tree-ssa-structalias.h (struct alias_info) <total_alias_vops>: Remove. Update all users. <written_vars>: Change to a pointer set. Update all users. <dereferenced_ptrs_store>: Likewise. <dereferenced_ptrs_load>: Likewise. (NUM_REFERENCES): Remove. Update all users. (NUM_REFERENCES_CLEAR): Remove. Update all users. (NUM_REFERENCES_INC): Remove. Update all users. (NUM_REFERENCES_SET): Remove. Update all users. * params.def (PARAM_GLOBAL_VAR_THRESHOLD): Remove. Update all users. (PARAM_MAX_ALIASED_VOPS): Set to 10. * tree-ssanames.c (make_ssa_name): Initialize SSA_NAME_IS_DEFAULT_DEF to 0. 2006-12-11 Aldy Hernandez <aldyh@redhat.com> * tree-ssa-dse.c (aggregate_vardecl_d): New. (dse_global_data): Add aggregate_vardecl field. (dse_possible_dead_store_p): New. Add prev_defvar variable. Allow immediate uses and previous immediate uses to differ if they are setting different parts of the whole. (get_aggregate_vardecl): New. (dse_record_partial_aggregate_store): New. (dse_whole_aggregate_clobbered_p): New. (dse_partial_kill_p): New. (dse_optimize_stmt): Abstract code checking a possible dead store into new function dse_possible_dead_store_p(). Call dse_maybe_record_aggregate_store(). When checking whether a STMT and its USE_STMT refer to the same memory address, check also for partial kills that clobber the whole. Move some variable definitions to the block where they are used. (aggregate_vardecl_hash): New. (aggregate_vardecl_eq): New. (aggregate_vardecl_free): New. (aggregate_whole_store_p): New. (tree_ssa_dse): Initialize and free aggregate_vardecl. Mark which aggregate stores we care about. 2006-12-11 Andrew Macleod <amacleod@redhat.com> * tree-ssa-operands.h (struct vuse_element_d): Declare. (vuse_element_t): Declare. (struct vuse_vec_d): Declare. (vuse_vec_p): Declare. (VUSE_VECT_NUM_ELEM): Define. (VUSE_VECT_ELEMENT_NC): Define. (VUSE_ELEMENT_PTR_NC): Define. (VUSE_ELEMENT_VAR_NC): Define. (VUSE_VECT_ELEMENT): Define. (VUSE_ELEMENT_PTR): Define. (VUSE_ELEMENT_VAR): Define. (struct maydef_optype_d) <use_var>: Remove. <use_ptr>: Remove. <usev>: Add. (struct vuse_optype_d) <kill_var>: Remove. <use_ptr>: Remove. <usev>: Add. (struct mustdef_optype_d) <kill_var>: Remove. <use_ptr>: Remove. <usev>: Add. (VUSE_OP_PTR): Add argument. Use VUSE_ELEMENT_PTR. (VUSE_OP): Add argument. Use VUSE_ELEMENT_PTR. (VUSE_NUM): Define. (VUSE_VECT): Define. (MAYDEF_OP_PTR): Add argument. Use VUSE_OP_PTR. (MAYDEF_OP): Add argument. Use VUSE_OP. (MAYDEF_NUM): Define. (MAYDEF_VECT): Define. (MUSTDEF_KILL_PTR): Use VUSE_OP_PTR. (MUSTDEF_KILL): Use VUSE_OP. (MUSTDEF_NUM): Define. (MUSTDEF_VECT): Define. (realloc_maydef): Declare. (realloc_vuse): Declare. (struct ssa_operand_iterator_d) <vuse_index>: Add. <mayuse_index>: Add. (LOADED_SYMS): Define. (STORED_SYMS): Define. (FOR_EACH_SSA_MUSTDEF_OPERAND): Call op_iter_next_mustdef. * tree-into-ssa.c: Adapt for multi-operand V_MAY_DEF and VUSE operators. * tree-pretty-print.c: Likewise. * tree-ssa-dse.c: Likewise. * tree-flow-inline.h: Likewise. (op_iter_next_mustdef): New. * tree-ssa-operands.c: Likewise. (ALLOC_OPTYPE): Remove. Update all users. (alloc_def): New. (alloc_use): New. (alloc_maydef): New. (alloc_vuse): New. (alloc_mustdef): New. (realloc_maydef): New. (realloc_vuse): New. 2006-12-11 Aldy Hernandez <aldyh@redhat.com> * tree-ssa-operands.c: Remove build_v_must_defs. (init_ssa_operands): Delete build_v_must_defs. (finalize_ssa_v_must_def_ops): Remove. (finalize_ssa_v_must_defs): Remove. (finalize_ssa_stmt_operands): Do not call finalize_ssa_v_must_defs. (start_ssa_stmt_operands): Do not check build_v_must_defs. (append_v_must_def): Delete. (copy_virtual_operands): Do not copy V_MUST_DEFs. (get_modify_expr_operands): Remove reference to V_MUST_DEF from comment. Remove opf_kill_def. (build_ssa_operands): Remove references to v_must_defs. (copy_virtual_operands): Same. (copy_virtual_operands): Same. (fini_ssa_operands): Same. (free_ssa_operands): Same. (add_mustdef_op): Remove. Remove mustdef_optype_p. (alloc_mustdef): Remove. Remove references to V_MUST_DEFs in comment at top of file. (get_expr_operands): Remove opf_kill_def. (opf_kill_def): Remove. (add_virtual_operand): Remove opf_kill_def. (get_indirect_ref_operands): Same. (get_tmr_operands): Same. * tree-vectorizer.c (rename_variables_in_bb): Remove SSA_OP_ALL_KILLS. * tree-ssa-loop-manip.c (find_uses_to_rename_stmt): Remove SSA_OP_ALL_KILLS. (check_loop_closed_ssa_stmt): Same. * tree-ssa.c (verify_def): Remove V_MUST_DEF from comment. (verify_use): Same. (verify_ssa): Remove V_MUST_DEFs traces. (verify_ssa): Remove SSA_OP_ALL_KILLS. * tree-into-ssa.c (mark_def_sites): Change SSA_OP_VMUSTDEF to SSA_OP_VMAYDEF. (rewrite_update_stmt): Remove SSA_OP_VIRTUAL_KILLS. (rewrite_stmt): Remove SSA_OP_ALL_KILLS. * tree-ssa-operands.h (struct stmt_operands_d): Remove V_MUST_DEF references. (MUSTDEF_OPS): Remove. (SSA_OP_VMUSTDEF): Remove. (FOR_EACH_SSA_MUSTDEF_OPERAND): Remove. (struct mustdef_optype_d): Remove. Remove mustdef_optype_p. (struct stmt_operands_d): Remove mustdef_ops. (ssa_operand_iterator_d): Remove mustdefs and mustkills. (SSA_OP_VIRTUAL_DEFS): Remove SSA_OP_VMUSTDEF. (MUSTDEF_RESULT_PTR): Remove. (MUSTDEF_RESULT): Remove. (MUSTDEF_KILL_PTR): Remove. (MUSTDEF_KILL): Remove. (MUSTDEF_NUM): Remove. (MUSTDEF_VECT): Remove. (SSA_OP_VIRTUAL_KILLS): Remove. (SSA_OP_ALL_VIRTUALS): Remove SSA_OP_VIRTUAL_KILLS. (SSA_OP_VMUSTKILL): Remove. (SSA_OP_ALL_KILLS): Remove. (SSA_OP_ALL_OPERANDS): Remove SSA_OP_ALL_KILLS. * tree-flow-inline.h (op_iter_init_def): Remove SSA_OP_VIRTUAL_KILLS. (delink_stmt_imm_use): Remove SSA_OP_ALL_KILLS. * tree-ssa-pre.c (compute_rvuse_and_antic_safe): Remove SSA_OP_VIRTUAL_KILLS. * tree-ssa-loop-im.c (determine_max_movement): Remove SSA_OP_VIRTUAL_KILLS. (gather_mem_refs_stmt): Same. (gather_mem_refs_stmt): Same. * tree-ssa-dce.c (mark_really_necessary_kill_operand_phis): Delete. (perform_tree_ssa_dce): Remove call to mark_really_necessary_kill_operand_phis. * tree-flow-inline.h (op_iter_init): Remove setting of mustdefs and mustkills. (op_iter_next_use): Do not check mustkills. (op_iter_next_def): Do not check mustdefs. (op_iter_next_tree): Do not check mustkills or mustdefs. (clear_and_done_ssa_iter): Do not set mustdefs or mustkills. (op_iter_next_maymustdef): Do not check mustkills. (op_iter_init_must_and_may_def): Remove SSA_OP_VMUSTKILL. (op_iter_init_mustdef): Remove. * tree-ssa-live.c (create_ssa_var_map): Change SSA_OP_VMUSTDEF to SSA_OP_VMAYDEF. * tree-ssa-dse.c (dse_optimize_stmt): Remove SSA_OP_VMUSTDEF. * tree-ssa-ccp.c: Remove V_MUST_DEF traces from comments. (visit_assignment): Same. * tree-ssa-copy.c (copy_prop_visit_assignment): Same. * tree-sra.c (mark_all_v_defs_1): Remove V_MUST_DEF from comment. * tree-outof-ssa.c (check_replaceable): Remove SSA_OP_VMUSTDEF. * tree-pretty-print.c (dump_vops): Remove printing of V_MUST_DEF. Remove kill_p variable. * tree-dfa.c (struct dfa_stats_d): Remove num_v_must_defs. (dump_dfa_stats): Remove code related to V_MUST_DEFs. (collect_dfa_stats_r): Do not set num_v_must_defs. (mark_new_vars_to_rename): Remove v_must_defs_{before,after} code. * tree-into-ssa.c (mark_def_sites): Change SSA_OP_VMUSTKILL to SSA_OP_VMAYUSE. * tree-ssa-pre.c (compute_rvuse_and_antic_safe): Remove SSA_OP_VMUSTDEF and SSA_OP_VMUSTKILL. * tree-ssa-propagate.c (stmt_makes_single_store): Remove SSA_OP_VMUSTDEF. From-SVN: r119760
414 lines
15 KiB
C
414 lines
15 KiB
C
/* Loop Vectorization
|
|
Copyright (C) 2003, 2004, 2005, 2006 Free Software Foundation, Inc.
|
|
Contributed by Dorit Naishlos <dorit@il.ibm.com>
|
|
|
|
This file is part of GCC.
|
|
|
|
GCC is free software; you can redistribute it and/or modify it under
|
|
the terms of the GNU General Public License as published by the Free
|
|
Software Foundation; either version 2, or (at your option) any later
|
|
version.
|
|
|
|
GCC is distributed in the hope that it will be useful, but WITHOUT ANY
|
|
WARRANTY; without even the implied warranty of MERCHANTABILITY or
|
|
FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
|
|
for more details.
|
|
|
|
You should have received a copy of the GNU General Public License
|
|
along with GCC; see the file COPYING. If not, write to the Free
|
|
Software Foundation, 51 Franklin Street, Fifth Floor, Boston, MA
|
|
02110-1301, USA. */
|
|
|
|
#ifndef GCC_TREE_VECTORIZER_H
|
|
#define GCC_TREE_VECTORIZER_H
|
|
|
|
#ifdef USE_MAPPED_LOCATION
|
|
typedef source_location LOC;
|
|
#define UNKNOWN_LOC UNKNOWN_LOCATION
|
|
#define EXPR_LOC(e) EXPR_LOCATION(e)
|
|
#define LOC_FILE(l) LOCATION_FILE (l)
|
|
#define LOC_LINE(l) LOCATION_LINE (l)
|
|
#else
|
|
typedef source_locus LOC;
|
|
#define UNKNOWN_LOC NULL
|
|
#define EXPR_LOC(e) EXPR_LOCUS(e)
|
|
#define LOC_FILE(l) (l)->file
|
|
#define LOC_LINE(l) (l)->line
|
|
#endif
|
|
|
|
/* Used for naming of new temporaries. */
|
|
enum vect_var_kind {
|
|
vect_simple_var,
|
|
vect_pointer_var,
|
|
vect_scalar_var
|
|
};
|
|
|
|
/* Defines type of operation. */
|
|
enum operation_type {
|
|
unary_op = 1,
|
|
binary_op,
|
|
ternary_op
|
|
};
|
|
|
|
/* Define type of available alignment support. */
|
|
enum dr_alignment_support {
|
|
dr_unaligned_unsupported,
|
|
dr_unaligned_supported,
|
|
dr_unaligned_software_pipeline,
|
|
dr_aligned
|
|
};
|
|
|
|
/* Define type of def-use cross-iteration cycle. */
|
|
enum vect_def_type {
|
|
vect_constant_def,
|
|
vect_invariant_def,
|
|
vect_loop_def,
|
|
vect_induction_def,
|
|
vect_reduction_def,
|
|
vect_unknown_def_type
|
|
};
|
|
|
|
/* Define verbosity levels. */
|
|
enum verbosity_levels {
|
|
REPORT_NONE,
|
|
REPORT_VECTORIZED_LOOPS,
|
|
REPORT_UNVECTORIZED_LOOPS,
|
|
REPORT_ALIGNMENT,
|
|
REPORT_DR_DETAILS,
|
|
REPORT_BAD_FORM_LOOPS,
|
|
REPORT_OUTER_LOOPS,
|
|
REPORT_DETAILS,
|
|
/* New verbosity levels should be added before this one. */
|
|
MAX_VERBOSITY_LEVEL
|
|
};
|
|
|
|
/*-----------------------------------------------------------------*/
|
|
/* Info on vectorized loops. */
|
|
/*-----------------------------------------------------------------*/
|
|
typedef struct _loop_vec_info {
|
|
|
|
/* The loop to which this info struct refers to. */
|
|
struct loop *loop;
|
|
|
|
/* The loop basic blocks. */
|
|
basic_block *bbs;
|
|
|
|
/* The loop exit_condition. */
|
|
tree exit_cond;
|
|
|
|
/* Number of iterations. */
|
|
tree num_iters;
|
|
|
|
/* Is the loop vectorizable? */
|
|
bool vectorizable;
|
|
|
|
/* Unrolling factor */
|
|
int vectorization_factor;
|
|
|
|
/* Unknown DRs according to which loop was peeled. */
|
|
struct data_reference *unaligned_dr;
|
|
|
|
/* peeling_for_alignment indicates whether peeling for alignment will take
|
|
place, and what the peeling factor should be:
|
|
peeling_for_alignment = X means:
|
|
If X=0: Peeling for alignment will not be applied.
|
|
If X>0: Peel first X iterations.
|
|
If X=-1: Generate a runtime test to calculate the number of iterations
|
|
to be peeled, using the dataref recorded in the field
|
|
unaligned_dr. */
|
|
int peeling_for_alignment;
|
|
|
|
/* The mask used to check the alignment of pointers or arrays. */
|
|
int ptr_mask;
|
|
|
|
/* All data references in the loop. */
|
|
VEC (data_reference_p, heap) *datarefs;
|
|
|
|
/* All data dependences in the loop. */
|
|
VEC (ddr_p, heap) *ddrs;
|
|
|
|
/* Statements in the loop that have data references that are candidates for a
|
|
runtime (loop versioning) misalignment check. */
|
|
VEC(tree,heap) *may_misalign_stmts;
|
|
|
|
/* The loop location in the source. */
|
|
LOC loop_line_number;
|
|
} *loop_vec_info;
|
|
|
|
/* Access Functions. */
|
|
#define LOOP_VINFO_LOOP(L) (L)->loop
|
|
#define LOOP_VINFO_BBS(L) (L)->bbs
|
|
#define LOOP_VINFO_EXIT_COND(L) (L)->exit_cond
|
|
#define LOOP_VINFO_NITERS(L) (L)->num_iters
|
|
#define LOOP_VINFO_VECTORIZABLE_P(L) (L)->vectorizable
|
|
#define LOOP_VINFO_VECT_FACTOR(L) (L)->vectorization_factor
|
|
#define LOOP_VINFO_PTR_MASK(L) (L)->ptr_mask
|
|
#define LOOP_VINFO_DATAREFS(L) (L)->datarefs
|
|
#define LOOP_VINFO_DDRS(L) (L)->ddrs
|
|
#define LOOP_VINFO_INT_NITERS(L) (TREE_INT_CST_LOW ((L)->num_iters))
|
|
#define LOOP_PEELING_FOR_ALIGNMENT(L) (L)->peeling_for_alignment
|
|
#define LOOP_VINFO_UNALIGNED_DR(L) (L)->unaligned_dr
|
|
#define LOOP_VINFO_MAY_MISALIGN_STMTS(L) (L)->may_misalign_stmts
|
|
#define LOOP_VINFO_LOC(L) (L)->loop_line_number
|
|
|
|
#define LOOP_VINFO_NITERS_KNOWN_P(L) \
|
|
(host_integerp ((L)->num_iters,0) \
|
|
&& TREE_INT_CST_LOW ((L)->num_iters) > 0)
|
|
|
|
/*-----------------------------------------------------------------*/
|
|
/* Info on vectorized defs. */
|
|
/*-----------------------------------------------------------------*/
|
|
enum stmt_vec_info_type {
|
|
undef_vec_info_type = 0,
|
|
load_vec_info_type,
|
|
store_vec_info_type,
|
|
op_vec_info_type,
|
|
call_vec_info_type,
|
|
assignment_vec_info_type,
|
|
condition_vec_info_type,
|
|
reduc_vec_info_type,
|
|
type_promotion_vec_info_type,
|
|
type_demotion_vec_info_type
|
|
};
|
|
|
|
/* Indicates whether/how a variable is used in the loop. */
|
|
enum vect_relevant {
|
|
vect_unused_in_loop = 0,
|
|
vect_used_by_reduction,
|
|
vect_used_in_loop
|
|
};
|
|
|
|
typedef struct data_reference *dr_p;
|
|
DEF_VEC_P(dr_p);
|
|
DEF_VEC_ALLOC_P(dr_p,heap);
|
|
|
|
typedef struct _stmt_vec_info {
|
|
|
|
enum stmt_vec_info_type type;
|
|
|
|
/* The stmt to which this info struct refers to. */
|
|
tree stmt;
|
|
|
|
/* The loop_vec_info with respect to which STMT is vectorized. */
|
|
loop_vec_info loop_vinfo;
|
|
|
|
/* Not all stmts in the loop need to be vectorized. e.g, the increment
|
|
of the loop induction variable and computation of array indexes. relevant
|
|
indicates whether the stmt needs to be vectorized. */
|
|
enum vect_relevant relevant;
|
|
|
|
/* Indicates whether this stmts is part of a computation whose result is
|
|
used outside the loop. */
|
|
bool live;
|
|
|
|
/* The vector type to be used. */
|
|
tree vectype;
|
|
|
|
/* The vectorized version of the stmt. */
|
|
tree vectorized_stmt;
|
|
|
|
|
|
/** The following is relevant only for stmts that contain a non-scalar
|
|
data-ref (array/pointer/struct access). A GIMPLE stmt is expected to have
|
|
at most one such data-ref. **/
|
|
|
|
/* Information about the data-ref (access function, etc). */
|
|
struct data_reference *data_ref_info;
|
|
|
|
/* Stmt is part of some pattern (computation idiom) */
|
|
bool in_pattern_p;
|
|
|
|
/* Used for various bookkeeping purposes, generally holding a pointer to
|
|
some other stmt S that is in some way "related" to this stmt.
|
|
Current use of this field is:
|
|
If this stmt is part of a pattern (i.e. the field 'in_pattern_p' is
|
|
true): S is the "pattern stmt" that represents (and replaces) the
|
|
sequence of stmts that constitutes the pattern. Similarly, the
|
|
related_stmt of the "pattern stmt" points back to this stmt (which is
|
|
the last stmt in the original sequence of stmts that constitutes the
|
|
pattern). */
|
|
tree related_stmt;
|
|
|
|
/* List of datarefs that are known to have the same alignment as the dataref
|
|
of this stmt. */
|
|
VEC(dr_p,heap) *same_align_refs;
|
|
|
|
/* Classify the def of this stmt. */
|
|
enum vect_def_type def_type;
|
|
|
|
/* Interleaving info. */
|
|
/* First data-ref in the interleaving group. */
|
|
tree first_dr;
|
|
/* Pointer to the next data-ref in the group. */
|
|
tree next_dr;
|
|
/* The size of the interleaving group. */
|
|
unsigned int size;
|
|
/* For stores, number of stores from this group seen. We vectorize the last
|
|
one. */
|
|
unsigned int store_count;
|
|
/* For loads only, the gap from the previous load. For consecutive loads, GAP
|
|
is 1. */
|
|
unsigned int gap;
|
|
/* In case that two or more stmts share data-ref, this is the pointer to the
|
|
previously detected stmt with the same dr. */
|
|
tree same_dr_stmt;
|
|
} *stmt_vec_info;
|
|
|
|
/* Access Functions. */
|
|
#define STMT_VINFO_TYPE(S) (S)->type
|
|
#define STMT_VINFO_STMT(S) (S)->stmt
|
|
#define STMT_VINFO_LOOP_VINFO(S) (S)->loop_vinfo
|
|
#define STMT_VINFO_RELEVANT(S) (S)->relevant
|
|
#define STMT_VINFO_LIVE_P(S) (S)->live
|
|
#define STMT_VINFO_VECTYPE(S) (S)->vectype
|
|
#define STMT_VINFO_VEC_STMT(S) (S)->vectorized_stmt
|
|
#define STMT_VINFO_DATA_REF(S) (S)->data_ref_info
|
|
#define STMT_VINFO_IN_PATTERN_P(S) (S)->in_pattern_p
|
|
#define STMT_VINFO_RELATED_STMT(S) (S)->related_stmt
|
|
#define STMT_VINFO_SAME_ALIGN_REFS(S) (S)->same_align_refs
|
|
#define STMT_VINFO_DEF_TYPE(S) (S)->def_type
|
|
#define STMT_VINFO_DR_GROUP_FIRST_DR(S) (S)->first_dr
|
|
#define STMT_VINFO_DR_GROUP_NEXT_DR(S) (S)->next_dr
|
|
#define STMT_VINFO_DR_GROUP_SIZE(S) (S)->size
|
|
#define STMT_VINFO_DR_GROUP_STORE_COUNT(S) (S)->store_count
|
|
#define STMT_VINFO_DR_GROUP_GAP(S) (S)->gap
|
|
#define STMT_VINFO_DR_GROUP_SAME_DR_STMT(S)(S)->same_dr_stmt
|
|
|
|
#define DR_GROUP_FIRST_DR(S) (S)->first_dr
|
|
#define DR_GROUP_NEXT_DR(S) (S)->next_dr
|
|
#define DR_GROUP_SIZE(S) (S)->size
|
|
#define DR_GROUP_STORE_COUNT(S) (S)->store_count
|
|
#define DR_GROUP_GAP(S) (S)->gap
|
|
#define DR_GROUP_SAME_DR_STMT(S) (S)->same_dr_stmt
|
|
|
|
#define STMT_VINFO_RELEVANT_P(S) ((S)->relevant != vect_unused_in_loop)
|
|
|
|
static inline void set_stmt_info (stmt_ann_t ann, stmt_vec_info stmt_info);
|
|
static inline stmt_vec_info vinfo_for_stmt (tree stmt);
|
|
|
|
static inline void
|
|
set_stmt_info (stmt_ann_t ann, stmt_vec_info stmt_info)
|
|
{
|
|
if (ann)
|
|
ann->common.aux = (char *) stmt_info;
|
|
}
|
|
|
|
static inline stmt_vec_info
|
|
vinfo_for_stmt (tree stmt)
|
|
{
|
|
stmt_ann_t ann = stmt_ann (stmt);
|
|
return ann ? (stmt_vec_info) ann->common.aux : NULL;
|
|
}
|
|
|
|
/*-----------------------------------------------------------------*/
|
|
/* Info on data references alignment. */
|
|
/*-----------------------------------------------------------------*/
|
|
|
|
/* Reflects actual alignment of first access in the vectorized loop,
|
|
taking into account peeling/versioning if applied. */
|
|
#define DR_MISALIGNMENT(DR) (DR)->aux
|
|
|
|
static inline bool
|
|
aligned_access_p (struct data_reference *data_ref_info)
|
|
{
|
|
return (DR_MISALIGNMENT (data_ref_info) == 0);
|
|
}
|
|
|
|
static inline bool
|
|
known_alignment_for_access_p (struct data_reference *data_ref_info)
|
|
{
|
|
return (DR_MISALIGNMENT (data_ref_info) != -1);
|
|
}
|
|
|
|
/* Perform signed modulo, always returning a non-negative value. */
|
|
#define VECT_SMODULO(x,y) ((x) % (y) < 0 ? ((x) % (y) + (y)) : (x) % (y))
|
|
|
|
/* vect_dump will be set to stderr or dump_file if exist. */
|
|
extern FILE *vect_dump;
|
|
extern enum verbosity_levels vect_verbosity_level;
|
|
|
|
/* Bitmap of virtual variables to be renamed. */
|
|
extern bitmap vect_memsyms_to_rename;
|
|
|
|
/*-----------------------------------------------------------------*/
|
|
/* Function prototypes. */
|
|
/*-----------------------------------------------------------------*/
|
|
|
|
/*************************************************************************
|
|
Simple Loop Peeling Utilities - in tree-vectorizer.c
|
|
*************************************************************************/
|
|
/* Entry point for peeling of simple loops.
|
|
Peel the first/last iterations of a loop.
|
|
It can be used outside of the vectorizer for loops that are simple enough
|
|
(see function documentation). In the vectorizer it is used to peel the
|
|
last few iterations when the loop bound is unknown or does not evenly
|
|
divide by the vectorization factor, and to peel the first few iterations
|
|
to force the alignment of data references in the loop. */
|
|
extern struct loop *slpeel_tree_peel_loop_to_edge
|
|
(struct loop *, edge, tree, tree, bool);
|
|
extern void slpeel_make_loop_iterate_ntimes (struct loop *, tree);
|
|
extern bool slpeel_can_duplicate_loop_p (struct loop *, edge);
|
|
#ifdef ENABLE_CHECKING
|
|
extern void slpeel_verify_cfg_after_peeling (struct loop *, struct loop *);
|
|
#endif
|
|
|
|
|
|
/*************************************************************************
|
|
General Vectorization Utilities
|
|
*************************************************************************/
|
|
/** In tree-vectorizer.c **/
|
|
extern tree get_vectype_for_scalar_type (tree);
|
|
extern bool vect_is_simple_use (tree, loop_vec_info, tree *, tree *,
|
|
enum vect_def_type *);
|
|
extern bool vect_is_simple_iv_evolution (unsigned, tree, tree *, tree *);
|
|
extern tree vect_is_simple_reduction (struct loop *, tree);
|
|
extern bool vect_can_force_dr_alignment_p (tree, unsigned int);
|
|
extern enum dr_alignment_support vect_supportable_dr_alignment
|
|
(struct data_reference *);
|
|
extern bool reduction_code_for_scalar_code (enum tree_code, enum tree_code *);
|
|
extern bool supportable_widening_operation (enum tree_code, tree, tree,
|
|
tree *, tree *, enum tree_code *, enum tree_code *);
|
|
/* Creation and deletion of loop and stmt info structs. */
|
|
extern loop_vec_info new_loop_vec_info (struct loop *loop);
|
|
extern void destroy_loop_vec_info (loop_vec_info);
|
|
extern stmt_vec_info new_stmt_vec_info (tree stmt, loop_vec_info);
|
|
|
|
|
|
/** In tree-vect-analyze.c **/
|
|
/* Driver for analysis stage. */
|
|
extern loop_vec_info vect_analyze_loop (struct loop *);
|
|
|
|
|
|
/** In tree-vect-patterns.c **/
|
|
/* Pattern recognition functions.
|
|
Additional pattern recognition functions can (and will) be added
|
|
in the future. */
|
|
typedef tree (* vect_recog_func_ptr) (tree, tree *, tree *);
|
|
#define NUM_PATTERNS 4
|
|
void vect_pattern_recog (loop_vec_info);
|
|
|
|
|
|
/** In tree-vect-transform.c **/
|
|
extern bool vectorizable_load (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_store (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_operation (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_type_promotion (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_type_demotion (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_assignment (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_function (tree, tree);
|
|
extern bool vectorizable_call (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_condition (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_live_operation (tree, block_stmt_iterator *, tree *);
|
|
extern bool vectorizable_reduction (tree, block_stmt_iterator *, tree *);
|
|
/* Driver for transformation stage. */
|
|
extern void vect_transform_loop (loop_vec_info);
|
|
|
|
/*************************************************************************
|
|
Vectorization Debug Information - in tree-vectorizer.c
|
|
*************************************************************************/
|
|
extern bool vect_print_dump_info (enum verbosity_levels);
|
|
extern void vect_set_verbosity_level (const char *);
|
|
extern LOC find_loop_location (struct loop *);
|
|
|
|
#endif /* GCC_TREE_VECTORIZER_H */
|