D is the number of basis functions used for the approximation (it is a parameter that may be estimated or fixed); G(g)=g2*log(g) (other functions are possible); rd is the prototypical location value for the dth basis function; and td is a parameter value that is chosen to minimize the sum of squared errors below over a large ???training set??? of content items ???