Added: Jan. 10, 2013, 11:57 p.m.
FOV: 96.225° (larger dimension)
Focal length: 0.448 × height
Scene: staircase
Scene correct: 
		
		True
		
	
Whitebalanced: 
		
		True
		
	
Scene label correct votes:
Scene label correct score:
Whitebalance votes:
Whitebalance score:
Flickr user: mwichary
Flickr ID: 6015956174
License: Attribution 2.0 Generic
	
	(Credit: Marcin Wichary)
Material segmentations
Users were asked to draw around regions of a single type of material.
No items are available.
Vanishing points
Each color corresponds to one vanishing point.  Hover over the points on the right to see the full lines.
- 0: [-0.009763, -0.999356, -0.034523]
- 1: [0.870377, 0.007061, -0.492335]
- 2: [-0.431134, -0.546168, -0.718209]
- 3: [-0.415611, 0.584661, -0.696734]
- 4: [-0.530087, 0.001542, -0.847942]
- 5: [0.454947, -0.076857, -0.887196]
- 6: [0.183191, -0.560284, -0.807789]
- 7: [0.052085, 0.769137, -0.636957]
Whitebalance points
Users were asked to click on points that are white or gray.
Each color corresponds to one user.
Median chroma- 9.252 (17.5 s)
- 10.895 (15.6 s)
- 6.444 (12.7 s)
- 6.310 (7.74 s)
- 4.992 (5.38 s)
- 15.315 (3.05 s)
 Human reflectance judgements
Our user interface for collecting annotations shows the user
				an image and asks them, for a particular pair of pixels
				(indicated with crosshairs and labeled Points 1 and 2), which
				of the two points has a darker surface color. The user can then
				select one of three options: Point 1, Point 2, and About the
				same. We ask users to specify their confidence in their
				assessment as Guessing, Probably, or Definitely, as was done by
				[Branson et al. 2010].
We aggregate judgements from 5 users for each pair of points
				and use the CUBAM machine learning model [Welinder et al. 2010]
				to model two forms of bias.
See our publication for more details.

Our user interface
Intrinsic image decompositions
The input image is decomposed into a "reflectance" and "shading" layer.  Note that the reflectance layer is listed twice: color (left) and grayscale (center).  Decompositions are ordered by error and then runtime (best on top).  The parameters for each algorithm are the same for all photos; they are set to the values that produce lowest mean error (WHDR) for all photos.  See our publication for more details.
- Algorithm: bell2014_densecrf- Parameters:  - abs reflectance weight: 0
- abs shading gray point: 0.5
- abs shading weight: 500.0
- chromaticity weight: 0
- kmeans intensity scale: 0.5
- kmeans n clusters: 20
- n iters: 25
- pairwise intensity chromaticity: True
- pairwise weight: 104
- shading blur init method: none
- shading blur sigma: 0.1
- shading target norm: L2
- shading target weight: 20000.0
- split clusters: True
- theta c: 0.025
- theta l: 0.1
- theta p: 0.1
 
- Result:
			 - Weighted human disagreement rate (WHDR): 29.0% (δ: 0.1)
- WHDR for equal edges only: 0.3366
- WHDR for inequalities only: 0.1803
- Runtime: 318.1 s
 
- Algorithm: garces2012_clustering- Parameters:  - km k: 8
- remap gamma 2 2: False
 
- Result:
			 - Weighted human disagreement rate (WHDR): 29.8% (δ: 0.1)
- WHDR for equal edges only: 0.3516
- WHDR for inequalities only: 0.1704
- Runtime: 6.5 s
 
- Algorithm: baseline_reflectance- Result:
			 - Weighted human disagreement rate (WHDR): 29.9% (δ: 0.1)
- WHDR for equal edges only: 0.0000
- WHDR for inequalities only: 1.0000
- Runtime: 0.1 s
 
- Algorithm: grosse2009_grayscale_retinex- Citation: - Roger Grosse, Micah K. Johnson, Edward H. Adelson, William T. Freeman.  "Ground truth dataset and baseline evaluations for intrinsic image algorithms".   Proceedings of the International Conference on Computer Vision (ICCV)- .   http://www.cs.toronto.edu/~rgrosse/intrinsic/- . 
- Result:
			 - Weighted human disagreement rate (WHDR): 34.5% (δ: 0.1)
- WHDR for equal edges only: 0.4338
- WHDR for inequalities only: 0.1346
- Runtime: 188.0 s
 
- Algorithm: shen2011_optimization- Parameters:  - rho: 1.9
- unmap srgb: False
- wd: 3
 
- Result:
			 - Weighted human disagreement rate (WHDR): 35.0% (δ: 0.1)
- WHDR for equal edges only: 0.3958
- WHDR for inequalities only: 0.2426
- Runtime: 276.6 s
 
- Algorithm: zhao2012_nonlocal- Parameters:  - chrom thresh: 0.001
- gamma: False
- texture patch distance: 0.0003
- texture patch variance: 0.03
 
- Result:
			 - Weighted human disagreement rate (WHDR): 35.0% (δ: 0.1)
- WHDR for equal edges only: 0.4031
- WHDR for inequalities only: 0.2263
- Runtime: 30.8 s
 
- Algorithm: grosse2009_color_retinex- Citation: - Roger Grosse, Micah K. Johnson, Edward H. Adelson, William T. Freeman.  "Ground truth dataset and baseline evaluations for intrinsic image algorithms".   Proceedings of the International Conference on Computer Vision (ICCV)- .   http://www.cs.toronto.edu/~rgrosse/intrinsic/- . 
- Parameters:  - L1: True
- threshold color: 0.7
- threshold gray: 0.5
 
- Result:
			 - Weighted human disagreement rate (WHDR): 35.1% (δ: 0.1)
- WHDR for equal edges only: 0.4424
- WHDR for inequalities only: 0.1353
- Runtime: 173.8 s
 
- Algorithm: baseline_shading- Result:
			 - Weighted human disagreement rate (WHDR): 56.6% (δ: 0.1)
- WHDR for equal edges only: 0.7694
- WHDR for inequalities only: 0.0886
- Runtime: 0.1 s