Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Towards Handling Big LHS #361

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 10 additions & 4 deletions lib/Infer/ExhaustiveSynthesis.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,9 @@ namespace {
"The larger the number is, the more fine-grained debug "
"information will be printed"),
cl::init(0));
static cl::opt<bool> NoBigQuery("souper-exhaustive-synthesis-no-big-query",
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's get some data about this also -- maybe run synthesis with and without it and see how it changes the CPU time used

cl::desc("Disable big query in exhaustive synthesis (default=false)"),
cl::init(false));
}

// TODO
Expand Down Expand Up @@ -374,8 +377,10 @@ ExhaustiveSynthesis::synthesize(SMTLIBSolver *SMTSolver,
const std::vector<InstMapping> &PCs,
Inst *LHS, Inst *&RHS,
InstContext &IC, unsigned Timeout) {

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should talk about this part. why do you think it's a good idea? do you have data supporting that it does a better job?

Copy link
Contributor Author

@zhengyang92 zhengyang92 Oct 3, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did this because findCands() stops when it reaches MaxLHSCands(which is set to 15 by default). However, this might miss the input variable declared at the top of program. Due to this limitation, the synthesizer cannot gives correct ctpop result since the input %x is not harvested as a synthesis component.

Exhaustive synthesis has a call findVars() after doing the big query, hoisting up this call does not have much impact to the compilation performance, compared with the potential benefit of getting better synthesis results.

This approach is stupid I admit, it basically runs the traversal on the LHS twice. I can write a better version of the findCands() which runs once.

Copy link
Contributor Author

@zhengyang92 zhengyang92 Oct 3, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This code basically guarantees all the program inputs are collected as components in the synthesis.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree that tuning MaxLHSCands will be useful, but just harvesting the vars doesn't sound like the right solution.
for one thing, what if there are 500 vars? then synthesis of just one select instruction is going to involve evaluating 500x500x500 guesses

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the idea behind the hard bound is that it prevents these degenerate cases, but your patch will just open us up to them again

Copy link
Contributor Author

@zhengyang92 zhengyang92 Oct 3, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay this makes sense. But instead of stopping at a hard bounded limit when doing findCands(), can we traverse the whole tree and hard bound the result by the descending order by the "benefit" of each candidates? There is already a "benefit" tagging algorithm there implemented in findCands().

Copy link
Contributor Author

@zhengyang92 zhengyang92 Oct 3, 2018

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Current "benefit" tagging algorithm simply follows: the deeper the traverse goes, the higher benefit the candidate gets. We may further customize this benefit function to fit our needs. Say some algorithm can tell some kind of candidates are more likely to be used as a component in synthesis, then we may increase the "benefit" of this kind of candidates, to reduce the synthesis time.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

well, the benefit is a property of the RHS cost vs. LHS cost, not a property of the choice of inputs, which is what we're talking about here.

my observation is that most rewrites are going to occur at the bottom of a souper LHS, which is why I wrote the code to get inputs using DFS on the LHS.

if we pick the ones likely to lead to the highest benefit, you're saying to pick inputs from the top of the LHS. I'm not opposed to this, but I think it is a mistake to make the change before we have data showing that the change is a good one.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

anyway I'm saying:

  • this is really two patches, please split them
  • for each patch, I want you to justify it using data instead of making guesses about what will work best

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure I will split this pr and collect data

std::vector<Inst *> Inputs;
findCands(LHS, Inputs, /*WidthMustMatch=*/false, /*FilterVars=*/false, MaxLHSCands);

if (DebugLevel > 1)
llvm::errs() << "got " << Inputs.size() << " candidates from LHS\n";

Expand All @@ -397,7 +402,7 @@ ExhaustiveSynthesis::synthesize(SMTLIBSolver *SMTSolver,

// Big Query
// TODO: Need to check if big query actually saves us time or just wastes time
{
if (!NoBigQuery) {
Inst *Ante = IC.getConst(APInt(1, true));
BlockPCs BPCsCopy;
std::vector<InstMapping> PCsCopy;
Expand Down Expand Up @@ -445,6 +450,10 @@ ExhaustiveSynthesis::synthesize(SMTLIBSolver *SMTSolver,
// find the valid one
int Unsat = 0;
int GuessIndex = -1;

std::vector<Inst *> Vars;
findVars(LHS, Vars);

for (auto I : Guesses) {
GuessIndex++;
if (DebugLevel > 2) {
Expand Down Expand Up @@ -478,9 +487,6 @@ ExhaustiveSynthesis::synthesize(SMTLIBSolver *SMTSolver,
}
}

std::vector<Inst *> Vars;
findVars(LHS, Vars);

std::map<Inst *, llvm::APInt> ConstMap;

{
Expand Down
24 changes: 18 additions & 6 deletions lib/Inst/Inst.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -783,8 +783,16 @@ void souper::findCands(Inst *Root, std::vector<Inst *> &Guesses,
bool WidthMustMatch, bool FilterVars, int Max) {
// breadth-first search
std::set<Inst *> Visited;
std::queue<std::tuple<Inst *,int>> Q;
Q.push(std::make_tuple(Root, 0));
std::queue<std::pair<Inst *,int>> Q;
auto Comp = [](const std::pair<Inst *,int> A, const std::pair<Inst *,int> B)
{
return A.second < B.second;
};
std::priority_queue<std::pair<Inst *, int>,
std::vector<std::pair<Inst *,int>>,
decltype(Comp)> GuessesPQ(Comp);

Q.push(std::make_pair(Root, 0));
while (!Q.empty()) {
Inst *I;
int Benefit;
Expand All @@ -794,20 +802,24 @@ void souper::findCands(Inst *Root, std::vector<Inst *> &Guesses,
if (Visited.insert(I).second) {
if (I->K != Inst::Phi) {
for (auto Op : I->Ops)
Q.push(std::make_tuple(Op, Benefit));
Q.push(std::make_pair(Op, Benefit));
}
if (Benefit > 1 && I->Available && I->K != Inst::Const
&& I->K != Inst::UntypedConst) {
if (WidthMustMatch && I->Width != Root->Width)
continue;
if (FilterVars && I->K == Inst::Var)
continue;
Guesses.emplace_back(I);
if (Guesses.size() >= Max)
return;
GuessesPQ.push(std::make_pair(I, Benefit));
}
}
}
unsigned Size = GuessesPQ.size();
for (unsigned T = 0 ; T < Max && T < Size ; T++) {
Inst *I = GuessesPQ.top().first;
GuessesPQ.pop();
Guesses.emplace_back(I);
}
}

Inst *souper::getInstCopy(Inst *I, InstContext &IC,
Expand Down
2 changes: 2 additions & 0 deletions test/Infer/popcount6-syn.opt
Original file line number Diff line number Diff line change
@@ -1,7 +1,9 @@
; REQUIRES: solver, solver-model

; RUN: %souper-check %solver -infer-rhs -souper-infer-inst -souper-synthesis-comps=ctpop %s > %t1
; RUN: %souper-check %solver -infer-rhs -souper-exhaustive-synthesis -souper-exhaustive-synthesis-no-big-query %s > %t2
; RUN: %FileCheck %s < %t1
; RUN: %FileCheck %s < %t2

; CHECK: %21:i32 = ctpop %0
; CHECK-NEXT: result %21
Expand Down