double dummy hand evaluatoraldous/research/... · it is a classic and the most popular method of...
TRANSCRIPT
1
DoubleDummyHandEvaluator
ZhenyangZhangAdvisor:DavidAldous
1.IntroductionThegoalofthisstudyistoanalyzemethodsthatalreadyexistandcomeupwithanewwayofthinkingabouttrick-takingvaluesofdifferenthands.Ratherthanmakinganyconclusionsonhandevaluation,Iwouldliketobemorecautious,andconsidermymodela“baby”versionofadoubledummyhandevaluator.Ihopethatthiswillbringupmorethoughtsabouthandevaluationmodelinginthefuture.IamusingdoubledummydatafromMattGinsberg’sdoubledummylibrary.Onemayarguethatdoubledummytricksarenot“realistic”sincesometimesresultsarenotachievableintherealworldplays.Forexample,doubledummytrickswillalwaysrunasuitmissingQ,whileinrealityplayersmayguesstheQonthewrongsideandloseonetrickasaresult.However,therearestillalotofadvantagesofusingit.First,therearealotofhandsinGinsberg’slibrary,717102handsintotal.Andinsomesensedoubledummytricksaremore“pure”.Wedonothavetotakeintoaccountifplayersmadeamistakeorusedsomeat-tableinformation(likehandgestures,facialexpressions,etc.)duringtheplaysothattheywonmoreorfewertricksthanexpected.Also,expertplayershaveinclinationtobidgames,whichmeansthatminorsuitcontractswillbelessplayedandtheresultwillbesomehowbiased,whiledoubledummyresultswillnotbeaffected.2.ObservationsonexistingmethodsBeforeweactuallystepintoourmodelofevaluatinghands,wewishtoexamineafewexistingmethodsofevaluatinghands.Thefourmethodsarehighcardpoints,distributionalpoints,controlsandlosers.Theyareamongthemostpopularmethodsofevaluatingahandthathavebeenusedbyprofessionalplayers.Butthroughthebelowanalysiswecanseethatthesearenotidealwaysoftellinghowgoodyourhandis.-HighCardPointsItisaclassicandthemostpopularmethodofevaluatingone’shandthateveryonewilllearnatthebeginningofplayingbridge.Aceiscountedas4points,King3,Queen2andJack1.Intotalthereare40pointsandahandofmorethan10pointsisconsideredgood.
2
Hereisahistogramofhighcardpointsof1000randomsampleddeals.Wecanseethatthedistributionisroughlynormalandthusarobustmethod.However,boththeplotandcorrelationbetweenHCPandtricksimplythatthisisnotsoprecise.Andthisisexactlywhatmanybridgeexpertshaveargued:HCPdoesnottakehandshapeintoaccount;QueensandJacksareoverestimatedsincetheyarenotsopowerfulintherealplaysincetheyarelikelytobecoveredbyAcesandKings.
Histogram for hcp
hcp
Frequency
0 5 10 15 20 25
0200
400
600
3
-DistributionpointsArevisedversionofHCPisdistributionpoint:thatistoadddistributionalvaluestooriginalhighcardpointsofahand.Adoubletonisworth1pointmore,singleton3andvoid5.SinceitbasicallyinheritsfromHCP,wecanseethatthedistributionisstillnormal-like.Thecorrelationofdistributionpointandnumberoftrickstowinisaround0.29,whichislargerthanbefore.ThereforeitisbetterthanHCPinsomesensewhilestillnotgoodenough.
0 5 10 15 20
24
68
1012
hcp vs tricks
hcp
tricks
4
Histogram for distp
distp
Frequency
0 5 10 15 20 25 30
0100
200
300
400
500
600
700
5
ControlsSincetherearecontroversialideasonvaluesofQueenandJacks,expertsreevaluatetheirhandsusingcontrolswhentheyarethinkingaboutslamcontracts.AceiscountedastwocontrolsandKingasone.Butfromthehistogramwecanseethatmosthandshave0-2controlsandareconsideredabadhand.BecauseitismorelikeababyversionofHCPnotconsideringQueensandJacks,thecorrelationbetweencontrolsandtrickswonarearound0.08whichimpliesitisnotagoodindicatorof
0 5 10 15 20 25
24
68
1012
distp vs tricks
distp
tricks
6
trickwinningpower.
Histogram for control
control
Frequency
0 2 4 6 8 10
0200
400
600
800
7
-LosersThisisthelastmethodwearegoingtolookat.Losersaresomehowmorecomplicatedtocount.Foreachsuit,justlookatthethreelargestcardsinthesuitarecounthowmanyofAceKingandQueenismissing.Forexample,AQhasoneloser,KQ432hasoneloser,andQ82hastwolosers.Themorelosersonehave,theworsehishandis.Thecorrelationisaround-0.2,adecentone,butstillnotgoodenough.
0 2 4 6 8 10
24
68
1012
controls vs tricks
control
tricks
8
Histogram for losers
loser
Frequency
2 4 6 8 10 12
0200
400
600
800
9
3.HandEvaluatorModelAftershowingthatexistingmethodsarenotidealweaponstouse,wenowwanttocomeupwithsomethingmore.Firstwewanttodecidewhichfactorswearelookingatinthemodel.Generallywehavefouraspectsthatweareconcerningaboutahandwhenwearebidding:notrumpcontractsandtrumpcontracts,anddefendinganddeclaringineachcase.Intheprogram,wearegoingtouseabbreviation,“NTOffense”,“NTDefense”,“TrumpOffense”and“TrumpDefense”torepresentthesefouraspects.Weassumethatthevalueofahandisthelinearsumofthevaluesofeachsuit,andthuswehavethefollowingequation,wheretisthenumberoftricksyouexpecttogetinacertainsuit.
2 4 6 8 10 12
24
68
1012
losers vs tricks
losers
tricks
10
V (h) = t(s)s∈h∑
Weallknowthatalongersuitismorelikelytowinmoretrickssincewehavepotentialchancetorunthesuitandwintherestofthesmallcardsinthesuit.Thuswewishtonormalizeandreducetheeffectofthissuit.Wewishtotaketheexpectedwinningtricksofthesuitinthispattern(thatis,suitsthathavethesamehighcards),minustheexpectedwinningtricksamongallthesuitsofthesamelength.Theequationisstatedbelow:
t(s) = ave(s)− ave(length(s))
Basedonthesetwoequations,weareabletoanalyzetheexpectedtrickswecanwingiven a hand.Wewrote the code in python andwith an input, the hand analysisfunctionwouldprintoutexpectedtrickstowininfouroccasionsrespectively.
Forexample,Ijustrandomlypickedahandfrommyhandhistoryandwishtoknowhowgooditis.ThusIinputthehandinterminalandruntheprogram.
Wecanseefromtheoutputthatweareexpectedtowin7tricksinNTcontracts,and7.8tricksintrumpcontractsifwearedeclarer.Andweareexpectedtowin8.8tricksinNTcontractsand9tricksintrumpcontractsifwearedefending.Thisissuggestingthatourhandisbetterthanaverage,soweshoulddefinitelyopenthebidding.However,itisabetterhandindefense,andhasalotofpotentialindefendingopponents’contract.Sowearealsohappytodefendiftheopponentsstarttointerfereinthebidding.
11
4.StrengthandWeaknessoftheModel:Thismodelusesthewholesetofdataoutofover700000hands,andthusthenumericalanalysisisindeedreliable.Fromtheaboveanalysisweareabletomakedecisionsinbiddingmoreconfidently.However,wehavenotconsideredpartner’shandinthemodel,whichmeansthatweeliminatehalfoftheinformation.Furthermore,duringthebidding,asweknowmoreandmoreaboutpartner’shand,weshouldbeabletodynamicallyconditionontheknownfacttoreevaluateourhand’svalue,butunfortunatelywehavenotfoundgoodwaystogetthatinvolved.Finally,inthemodel,weassumethelinearityofhonorsindifferentsuitssothatwecansumthemtogethertogetthefinalresult.However,intherealworld,atrumpsuitwillplayalargerrolethanothersuit,butwecannotdecidethetrumpsuitjustlookingatourhandsincepartner’shandisunclear.Alsowejustignorethecomplexcorrelationbetweenhonorsinasuitandthusitmakesthemodelinaccurate.Inconclusion,Iwillrecommendthismodelmoreasareferenceforreconsideringyourhandvaluethananaccuratetrickwinningpowerreportofasinglehand.5.ReferenceMattGinsberg’spaperonLawoftotaltricks:http://bocosan.tripod.com/ginsberg/total.HTMLExplanationsofbinaryfileofdoubledummylibrary:http://bocosan.tripod.com/ginsberg/library_notes.HTML