capture_xam.cpp¶

A Capture Example and Speed Test¶

Syntax¶

build/speed/capture_xam \
      random_seed  \
      number_random   \
      quasi_fixed  \
      trace_optimize_fixed  \
      ipopt_solve  \
      bool_sparsity  \
      hold_memory  \
      derivative_test  \
      start_near_solution  \
      number_fixed_samples  \
      number_locations  \
      max_population  \
      mean_population  \
      mean_logit_probability  \
      std_logit_probability  \
      random_constraint

Reference¶

J. Andrew Royle, Biometrics 60, 108-115 March 2004, N-Mixture Models for Estimating Population Size from Spatially Replicated Counts.

Command Arguments¶

random_seed¶

This is a non-negative integer equal to the seed for the random number generator, to be specific, s_in used during the call to new_gsl_rng .

number_random¶

This is a positive integer equal to the number of random effects. This is also equal to the number of times at which the measurements are made and is denoted by \(T\) below.

quasi_fixed¶

This is either yes or no and is the value of quasi_fixed in the cppad_mixed derived class constructor. The amount of memory used by the mixed_derived object, after the information matrix is computed, will be similar to after the initialization when quasi_fixed is no.

trace_optimize_fixed¶

This is either yes or no . If it is yes, a print_level = 5 trace of the fixed effects optimization is included in the program output. Otherwise the ipopt print_level is zero and no such trace is printed.

ipopt_solve¶

This is either yes or no . If it is yes, the CppAD::ipopt::solve routine is used for optimizing the random effects, otherwise CppAD::mixed::ipopt_random is used; see evaluation_method .

bool_sparsity¶

This is either yes or no . If it is yes, boolean sparsity patterns are used for this computation, otherwise set sparsity patterns are used.

hold_memory¶

The CppAD memory allocator has a hold memory option will be set by

CppAD::thread_alloc::hold_memory ( hold_memory );

where hold_memory is either yes or no .

derivative_test¶

This is either yes or no . If it is yes, the derivatives of functions used in the optimization of the fixed effects are checked for correctness. (This requires extra time).

start_near_solution¶

This is either yes or no . If it is yes, the initial point for the optimization is the value of the fixed effects used to simulate the data. Otherwise, the initial point is significantly different from this value.

number_fixed_samples¶

This is a positive integer equal to the number of samples simulated from the posterior distribution for the fixed effects using sample_fixed . The samples are used to approximation the standard deviation for the optimal fixed effects. One should use a large number of samples (at least 100) for this purpose.

number_locations¶

This is a positive integer equal to the number of locations at which the measurements are made; i.e. \(R\) in the reference. Increasing this value increases the amount for each function evaluation, but does not change the number of fixed or random effects.

max_population¶

This is a positive integer equal to the maximum value in the finite summation with respect to population size; i.e., \(K\) in the reference. This must be greater than any of the simulated number of captures at any location and time; i.e., and \(y_{i,t}\). Also note that max_population does not affect the simulated data \(y_{i,t}\). A suggested value is five times mean_population . The value of max_population is large enough if increasing it takes more time but does not make a difference in the optimal fixed effects (use the same value for the other arguments and same actual seed).

mean_population¶

This is a positive floating point value equal to the mean of the Poisson distribution for the population used to simulate data values; \(\lambda\) in reference.

mean_logit_probability¶

This is a positive floating point value equal to the mean of the logit of the capture probability (independent of the random effects) used to simulate data values. The is also equal to the mean of the random effects.

std_logit_probability¶

This is a positive floating point value equal to the standard deviation of the logit of the capture probability (independent of the random effects) used to simulate data values. The is also equal to the standard deviation of the random effects.

random_constraint¶

This is either yes or no . If it is no , there is no random constraint for this example. If it is yes , the random constraint is

\[0 = \hat{u}_0 ( \theta ) + \cdots + \hat{u}_{T-1} ( \theta )\]

where \(\hat{u} ( \theta )\) is the optimal random effects . The corresponding random constraint matrix \(A\) is the row vector of size \(T\) with all ones; i.e.,

\[A = [ 1 , \cdots , 1 ] \in \B{R}^{1 \times T}\]

Output¶

Each output name, value pair is written in as name = value where the amount of spaces surrounding the equal sign is not specified. All of the pairs listed above are output. In addition, the following name value pairs are also output.

cppad_mixed_version¶

The cppad_mixed version number.

ldlt_cholmod¶

is the bin/run_cmake.sh configuration option ldlt_cholmod .

optimize_cppad_function¶

is the bin/run_cmake.sh configuration option optimize_cppad_function .

ndebug_defined¶

is the NDEBUG preprocessor symbol defined. This should be yes (no) if the bin/run_cmake.sh configuration option build_type is release (debug ).

actual_seed¶

If random_seed is zero, the system clock, instead of random_seed , is used to seed the random number generator. The actual random seed actual_seed is printed so that you can reproduce results when random_seed is zero.

initialize_bytes¶

Is the amount of heap memory, in bytes, added to the program during its initialize call. Note that more temporary memory may have been used during this call. In addition, only memory allocated using CppAD::thread_alloc is included.

initialize_seconds¶

Is the number of seconds used by the derived class initialize call.

optimize_fixed_seconds¶

Is the number of seconds used by the call to optimize_fixed that is used to compute the optimal fixed effects.

optimize_random_seconds¶

Is the number of seconds used by a single call to optimize_random that is used to compute the optimal random effects.

information_mat_seconds¶

Is the number of seconds used by the call to information_mat that computes the observed information matrix.

sample_fixed_seconds¶

Is the number of seconds used by the call to sample_fixed that computes the number_sample_fixed samples for the fixed effects.

final_bytes¶

Is final amount of heap memory, in bytes, added and retained by the program. Only memory allocated using CppAD::thread_alloc is included.

sum_random_effects¶

Is the sum of the optimal random effects.

mean_population_estimate¶

Is the estimate for the mean_population computed by optimize_fixed .

mean_logit_probability_estimate¶

Is the optimal estimate for the mean_logit_probability (computed by optimize_fixed ).

std_logit_probability_estimate¶

Is the optimal estimate for the std_logit_probability .

mean_population_std¶

Is the sample standard deviation of mean_population_estimate (corresponding to the sample computed by sample_fixed ).

mean_logit_probability_std¶

Is the sample standard deviation of mean_logit_probability_estimate .

std_logit_probability_std¶

Is the sample standard deviation of std_logit_probability_estimate .

mean_population_ratio¶

( mean_population_estimate - mean_population ) /

mean_population_std

mean_logit_probability_ratio¶

( mean_logit_probability_estimate - mean_logit_probability ) /

mean_logit_probability_std

std_probability_ratio¶

( std_probability_estimate - std_probability ) /

std_probability_std

capture_xam_ok¶

The following conditions are checked. If they are all true, capture_xam_ok is yes. Otherwise it is no.

sum_random_effects < 1 e-8 || ( random_constraint == no )
mean_population_ratio < 5.0
mean_logit_probability_ratio < 5.0
std_logit_probability_ratio < 5.0

If capture_xam_ok is yes, the program return value is 0 (no error condition). Otherwise it is 1 (error condition).

Example¶

The file capture_xam.sh is an example using this program.

Notation¶

\(R\)	number of sampling locations; i.e., number_locations .
\(T\)	number of sampling times; i.e., number_random .
\(K\)	maximum population in truncation of infinite summation; i.e., max_population
\(\theta_0\)	mean for \(N_i\) given \(\theta\) (\(\lambda\) in reference); i.e., mean_population .
\(\theta_1\)	mean of logit of capture probability; i.e., mean_logit_probability .
\(\theta_2\)	standard deviation of logit of capture probability; i.e., std_logit_probability .
\(N_i\)	size of the population at i-th location
\(y_{i,t}\)	number of captures at location \(i\) and time \(t\) (\(n_{i,t}\) in reference)
\(y_i\)	is the vector of captures at location \(i\) \(( y_{i,0} , \ldots , y_{i, T-1} )\).
\(M_i\)	maximum of captures at i-th location (\(\max_t n_{i,t}\) in reference)
\(q_t\)	capture probability at \(t\) same for all locations (\(p_{i,t}\) in reference)
\(u_t\)	random effect for each sampling time

p(y_it|N_i,q_t)¶

We use a binomial distribution to model the probability of \(y_{i,t}\) given \(N_i\) and \(q_t\); i.e,

\[\begin{split}\B{p} ( y_{i,t} | N_i , q_t ) = \left( \begin{array}{c} N_i \\ y_{i,t} \end{array} \right) q_t^{y(i,t)} \left( 1 - q_t \right)^{y(i,t)}\end{split}\]

Furthermore, we assume that this probability is independent for each \((i, t)\).

p(N_i|theta)¶

We use a Poisson distribution to model the probability of \(N_i\) given \(\theta_0\); i.e.,

\[\B{p} ( N_i | \theta ) = \theta_0^{N(i)} \frac{ \exp[ - \theta_0 ] }{ N_i ! }\]

We assume these this probability is independent for each \(i\).

q_t(theta,u)¶

Section 2.4 of the Reference suggests a covariate model for the probability of capture. We use a similar model defined by

\[\R{logit} ( q_t ) = u_t + \theta_1\]

It follows that

\[q_t( \theta , u) = [ 1 + \exp(- u_t - \theta_1 ) ]^{-1}\]

M_i¶

We define the vector of maximum measurement for each location by

\[M_i = \max \left\{ y_{i,0} , \cdots , y_{i, T-1} \right\}\]

p(y_i|theta,u)¶

The probability for \(y_i\) (the captures at location \(i\)) given \(N_i\), \(\theta\), and \(u\) is

\[\begin{split}\B{p}( y_i | N_i, \theta , u ) = \prod_{t=0}^{T-1} \left( \begin{array}{c} {N(i)} \\ y_{i,t} \end{array} \right) q_t ( \theta , u)^{y(i,t)} \left( 1 - q_t( \theta , u) \right)^{y(i,t)}\end{split}\]

We do not know the population at each location \(N_i\), but instead have a Poisson prior for \(N_i\). We sum with respect to the possible values for \(N_i\) to get the probability of \(y_i\) given \(\theta\) and \(u\).

\[\B{p}( y_i | \theta , u ) = \sum_{k=0}^K \B{p}( y_i | N_i=k, \theta , u ) \B{p}( N_i=k | \theta )\]

where \(k\) is the possible values for \(N_i\). Note that \(K\) should be plus infinity, but we use a fixed finite value for \(K\) as an approximation for the infinite sum.

p(y|theta,u)¶

Our model for the likelihood of the data at all the locations, given the fixed and random effects, is

\[\B{p}( y | \theta , u ) = \prod_{i=0}^{R-1} \B{p}( y_i | \theta , u )\]

Expressed in terms of fixed effects \(\theta\), the random effects \(u\), and the data \(y\), this is

\[\begin{split}\B{p}( y | \theta , u ) = \prod_{i=0}^{R-1} \left[ \sum_{k=0}^{K-1} \theta_0^k \frac{ \exp[ - \theta_0 ] }{ k ! } \prod_{t=0}^{T-1} \left( \begin{array}{c} {k} \\ y_{i,t} \end{array} \right) q_t ( \theta , u)^{y(i,t)} \left( 1 - q_t( \theta , u) \right)^{y(i,t)} \right]\end{split}\]

In cppad_mixed notation, this specifies the random data density .

p(u|theta)¶

We use a normal distribution, with mean zero and standard deviation \(\theta_2\), for the distribution of the random effects \(u\) given the fixed effects \(\theta\); i.e.,

\[\B{p} ( u | \theta ) = \prod_{t=0}^{T-1} \frac{1}{ \theta_2 \sqrt{ 2 \pi } } \exp \left[ - \frac{1}{2} \frac{ u_t^2 }{ \theta_2^2 } \right]\]

In cppad_mixed notation, this specifies the random prior density .

p(theta)¶

For this example there is no fixed prior density \(\B{p}(\theta)\).

p(z|theta)¶

For this example there is no fixed data density \(\B{p}(z | \theta)\).

c(theta)¶

For this example there is no fixed constraint function \(c( \theta )\).

Source Code¶# include <ctime>
# include <gsl/gsl_randist.h>
# include <cppad/utility/vector.hpp>
# include <cppad/utility/elapsed_seconds.hpp>
# include <cppad/mixed/cppad_mixed.hpp>
# include <cppad/mixed/manage_gsl_rng.hpp>
# include <cppad/mixed/configure.hpp>

namespace { // BEGIN_EMPTY_NAMESPACE
//
using std::exp;
using std::log;
//
using CppAD::mixed::s_vector;
using CppAD::mixed::d_vector;
using CppAD::mixed::d_sparse_rcv;
//
// Convert size_t to string adding commas every three digits
std::string size_t2string(size_t value )
{  std::string raw_string = CppAD::to_string(value);
   std::string result = "";
   size_t n_raw = raw_string.size();
   for(size_t i = 0; i < n_raw; i++)
   {  if( i != 0 && (n_raw - i) % 3 == 0 )
         result += ",";
      result += raw_string[i];
   }
   return result;
}

// simulate data, y
void simulate(
   bool                   random_constraint ,
   size_t                 R                 ,
   size_t                 T                 ,
   const d_vector&        theta             ,
   s_vector&              y                 )
{  assert( theta.size() == 3 );
   assert( y.size() == R * T );
   //
   // random number generator
   gsl_rng* rng = CppAD::mixed::get_gsl_rng();
   //
   // simulate population sizes
   d_vector N(R);
   double mu =  theta[0];
   for(size_t i = 0; i < R; i++)
      N[i] = gsl_ran_poisson(rng, mu );
   //
   // simulate random effects
   d_vector u(T);
   double sigma = theta[2];
   double sum   = 0.0;
   for(size_t t = 0; t < T; t++)
   {  u[t] = gsl_ran_gaussian(rng, sigma);
      sum += u[t];
   }
   // adjust the random effects when using random constraint
   if( random_constraint )
   {  for(size_t t = 0; t < T; t++)
      {  // simulate mean zero values
         u[t] = u[t] - sum / double(T);
         //
         // correct for loss of one degree of freedom
         u[t] = u[t] * sqrt( double(T) / double(T-1) );
      }
   }
   // simulate data
   for(size_t i = 0; i < R; i++)
   {  for(size_t t = 0; t < T; t++)
      {  // probability of capture
         double ex = exp( - u[t] - theta[1] );
         double q = 1.0 /( 1.0  + ex );
         y[ i * T + t ] = gsl_ran_binomial(rng, q, (unsigned int)N[i] );
      }
   }
   //
   return;
}

// cppad_mixed derived class
class mixed_derived : public cppad_mixed {
private:
   const size_t          R_; // number of locations
   const size_t          T_; // number of times
   size_t                K_; // max used when summing over population size
   const s_vector&       y_; // reference to data values
   // -----------------------------------------------------------------
   // set by constructor and then effectively const
   s_vector        M_;       // max number of captures at each location
   d_vector        logfac_;  // logfac_[k] = log( k! )
   d_vector        log_pik_; // used by implement_ran_likelihood
// ------------------------------------------------------------------------
public:
   // constructor
   mixed_derived(
      size_t                  R             ,
      size_t                  T             ,
      size_t                  K             ,
      bool                    quasi_fixed   ,
      bool                    bool_sparsity ,
      const d_sparse_rcv&     A_rcv         ,
      s_vector&               y             ,
      d_vector&               fixed_in      ,
      d_vector&               random_in     ) :
      // n_fixed = 3, n_random = T
      cppad_mixed(
         3, T, quasi_fixed, bool_sparsity, A_rcv
      )                ,
      R_(R)            ,
      T_(T)            ,
      K_(K)            ,
      y_(y)
   {  assert( fixed_in.size() == 3 );
      assert( random_in.size() == T );
      //
      // set M_
      M_.resize(R);
      for(size_t i = 0; i < R; i++)
      {  M_[i] = 0;
         for(size_t t = 0; t < T; t++)
            M_[i] = std::max( M_[i], y[ i * T + t] );
         assert( M_[i] < K_ );
      }
      //
      // set logfac_
      logfac_.resize(K_);
      logfac_[0]  = 0.0;
      logfac_[1]  = 0.0;
      for(size_t k = 2; k < K_; k++)
         logfac_[k] = log( double(k) ) + logfac_[k-1];
      //
      // set log_pik_
      log_pik_.resize(R_ * K_);
      for(size_t ell = 0; ell < R_ * K_; ell++)
         log_pik_[ell] = 0.0;
      template_ran_likelihood(fixed_in, random_in, log_pik_);
   }
   // implementation of ran_likelihood, used with Float = double and a1_double
   template <class Float>
   CppAD::vector<Float> template_ran_likelihood(
      const CppAD::vector<Float>&  theta   ,
      const CppAD::vector<Float>&  u       ,
      CppAD::vector<Float>&        log_pik )
   {  using CppAD::vector;
      //
      assert( log_pik.size() == R_* K_ );
      vector<Float> vec(1);
      //
      Float one( 1.0 );
      Float two( 2.0 );
      Float pi2( 8.0 * std::atan(1.0) );
      Float eps = Float( 10.0 * std::numeric_limits<double>::epsilon() );
      //  ------------------------------------------------------------
      // log [ p(u | theta) ]
      //  ------------------------------------------------------------
      Float sig = theta[2];
      vec[0] -= Float(T_) * ( two * log(sig) + log( pi2 ) );
      for(size_t t = 0; t < T_; t++)
      {  Float w = u[t] / ( sig + eps );
         vec[0] -= w * w;
      }
      vec[0] /= two;
      //  ------------------------------------------------------------
      // log [ p(y | theta, u) ]
      //  ------------------------------------------------------------
      //
      // y_{i,t} * log( qt )
      vector<Float> yit_log_q(R_ * T_);
      // log( 1.0 - qt )
      vector<Float> log_1q(T_);
      for(size_t t = 0; t < T_; t++)
      {  Float ex  = exp( - u[t] - theta[1] );
         Float q   = one / (one + ex );
         log_1q[t] = log(one - q + eps);
         for(size_t i = 0; i < R_; i++)
            yit_log_q[ i * T_ + t ]  = Float(y_[i*T_+t]) * log(q + eps);
      }
      //
      // log( theta[0]^k * exp( - theta[0] ) / k! )
      vector<Float> log_poisson(K_);
      for(size_t k = 0; k < K_; k++)
      {  log_poisson[k] = log( theta[0] + eps ) * Float(k);
         log_poisson[k] -= theta[0];
         log_poisson[k] -= logfac_[k];
      }
      //
      // log[ p(y|theta, u) ] to vec[0]
      for(size_t i = 0; i < R_; i++)
      {  // compute maximum of log_pik for this i
         Float max_log_pik = log_pik[ i * K_ + M_[i] ];
         for(size_t k = M_[i] + 1; k < K_; k++)
         {  if( log_pik[ i * K_ + k ] > max_log_pik )
               max_log_pik = log_pik[ i * K_ + k ];
         }
         //
         // initialize p(y_i|theta,u)
         Float p_i = Float(0.0);
         for(size_t k = M_[i]; k < K_; k++)
         {  // compute p(y_i|N_i=k,theta,u) p(N_i=k|theta)
            //
            // initialize sum that needs to be calculated with Float
            // p(N_i=k|theta)
            Float float_sum = log_poisson[k];
            //
            // initialize sum that does not need to use Float
            double double_sum = 0.0;
            //
            // now compute terms that depend on t
            for(size_t t = 0; t < T_; t++)
            {  size_t yit = y_[ i * T_ + t ];
               //
               // k choose yit = k! / [ yit! * (k - yit)! ]
               // log [ (k choose yit) ]
               double_sum += logfac_[k] - logfac_[yit] - logfac_[k - yit];
               //
               // log ( qt^yit )
               float_sum += yit_log_q[i * T_ + t];
               //
               // log [ (1 - qt)^(k - yit) ]
               float_sum += Float(k - yit) * log_1q[t];
            }
            // log[ p(y_i|N_i=k,theta,u) p(N_i=k|theta)
            // (output elements of log_pik are not affected by its input)
            log_pik[ i * K_ + k ] = float_sum + double_sum;
            //
            // normalization avoids exponential overflow
            // p_i += p(y_i|N_i=k,theta,u) p(N_i=k|theta) / exp(max_log_pik)
            p_i += exp( log_pik[ i * K_ + k ] - max_log_pik );
         }
         // vec[0] += log[ p(y_i|theta,u) ]
         vec[0] += log( p_i );
         // following term does not depend on the fixed or random effects
         // vec[0] += Float(K_ - M_[i]) * max_log_pik;
      }
      // - log [ p(y|theta,u) p(u|theta) ]
      vec[0] = - vec[0];
      //
      // result may be infinite or nan when only computing log_pik
      // (initial log_pik is used to compute normalization factors)
      return vec;
   }
// ------------------------------------------------------------------------
public:
   // a1_vector ran_likelihood
   virtual a1_vector ran_likelihood(
      const a1_vector& fixed_vec  ,
      const a1_vector& random_vec )
   {  a1_vector log_pik(R_ * K_), vec(1);
      for(size_t ell = 0; ell < R_ * K_; ell++)
         log_pik[ell] = a1_double(log_pik_[ell]);
      vec = template_ran_likelihood(fixed_vec, random_vec, log_pik);
      // make sure result is finite
# ifndef NDEBUG
      double inf = std::numeric_limits<double>::infinity();
# endif
      assert( vec[0] < + a1_double(inf) );
      assert( vec[0] > - a1_double(inf) );
      //
      return vec;
   }
};
template <class Value>
void label_print(const char* label, const Value& value)
{  std::cout << std::setw(35) << std::left << label;
   std::cout << " = " << value << std::endl;
}
void label_print(const char* label, const double& value)
{  std::cout << std::setw(35) << std::left << label;
   int n_digits = 3 + int( std::log10(value) + 1e-9 );
   std::cout << " = " << std::setprecision(n_digits) << value << std::endl;
}
void bool_print(const char* label, bool value)
{  std::cout << std::setw(35) << std::left << label;
   if( value )
      std::cout << " = yes";
   else
      std::cout << " = no";
   std::cout << std::endl;
}

} // END_EMPTY_NAMESPACE

int main(int argc, const char *argv[])
{  bool ok = true;
   using std::endl;
   using std::string;
   //
   const char* arg_name[] = {
      "random_seed",
      "number_random",
      "quasi_fixed",
      "trace_optimize_fixed",
      "ipopt_solve",
      "bool_sparsity",
      "hold_memory",
      "derivative_test",
      "start_near_solution",
      "number_fixed_samples",
      "number_locations",
      "max_population",
      "mean_population",
      "mean_logit_probability",
      "std_logit_probability",
      "random_constraint"
   };
   size_t n_arg = sizeof(arg_name)/sizeof(arg_name[0]);
   //
   if( size_t(argc) != 1 + n_arg )
   {  // print usage error message
      std::cerr << "expected " << n_arg << " arguments and found "
      << argc - 1 << "\nusage: " << argv[0];
      for(size_t i = 0; i < n_arg; i++)
         std::cerr << " \\ \n\t" << arg_name[i];
      std::cerr << "\n";
      std::exit(1);
   }
   //
   // get command line arguments
   assert( n_arg == 16 );
   size_t iarg = 1;
   size_t random_seed              = std::atoi( argv[iarg++] );
   size_t number_random            = std::atoi( argv[iarg++] );
   string quasi_fixed_str          = argv[iarg++];
   string trace_optimize_fixed_str = argv[iarg++];
   string ipopt_solve_str          = argv[iarg++];
   string bool_sparsity_str        = argv[iarg++];
   string hold_memory_str          = argv[iarg++];
   string derivative_test_str      = argv[iarg++];
   string start_near_solution_str  = argv[iarg++];
   //
   size_t number_fixed_samples   = std::atoi( argv[iarg++] );
   size_t number_locations       = std::atoi( argv[iarg++] );
   size_t max_population         = std::atoi( argv[iarg++] );
   double mean_population        = std::atof( argv[iarg++] );
   double mean_logit_probability = std::atof( argv[iarg++] );
   double std_logit_probability  = std::atof( argv[iarg++] );
   string random_constraint_str  = argv[iarg++];
   //
   bool random_constraint    =  random_constraint_str == "yes";
   bool quasi_fixed          =  quasi_fixed_str == "yes";
   bool trace_optimize_fixed =  trace_optimize_fixed_str == "yes";
   bool ipopt_solve          =  ipopt_solve_str == "yes";
   bool bool_sparsity        =  bool_sparsity_str == "yes";
   bool hold_memory          =  hold_memory_str == "yes";
   bool derivative_test      =  derivative_test_str == "yes";
   bool start_near_solution  =  start_near_solution_str == "yes";
   //
   // hold memory setting
   CppAD::thread_alloc::hold_memory(hold_memory);
   //
   // print the command line arguments with labels for each value
   for(size_t i = 0; i < n_arg; i++)
      label_print(arg_name[i], argv[1+i]);
   //
   assert( number_fixed_samples > 0 );
   assert( number_locations > 0 );
   assert( number_random > 0 );
   assert( max_population > 0 );
   assert( mean_population > 0.0 );
   assert( std_logit_probability > 0.0 );
   assert( random_constraint || random_constraint_str=="no" );
   assert( quasi_fixed || quasi_fixed_str=="no" );
   assert( trace_optimize_fixed || trace_optimize_fixed_str=="no" );
   assert( ipopt_solve || ipopt_solve_str=="no" );
   assert( bool_sparsity || bool_sparsity_str =="no" );
   assert( hold_memory || hold_memory_str =="no" );
   assert( derivative_test || derivative_test_str =="no" );
   assert( start_near_solution || start_near_solution_str =="no" );
   //
   // configuration options
   label_print("cppad_mixed_version",    CPPAD_MIXED_VERSION);
   bool_print("ldlt_cholmod",            CPPAD_MIXED_LDLT_CHOLMOD);
   bool_print("optimize_cppad_function", CPPAD_MIXED_OPTIMIZE_CPPAD_FUNCTION);
# ifdef NDEBUG
   bool_print("ndebug_defined",          true);
# else
   bool_print("ndebug_defined",          false);
# endif
   //
   // Get actual seed (different when random_seed is zero).
   size_t actual_seed     = CppAD::mixed::new_gsl_rng( random_seed );
   label_print("actual_seed", actual_seed);
   //
   // number of fixed and random effects
   size_t n_fixed  = 3;
   size_t n_random = number_random;
   //
   // number of random effects
   size_t T = number_random;
   size_t R = number_locations;
   size_t K = max_population;
   //
   d_vector theta_sim(n_fixed);
   theta_sim[0] = mean_population;
   theta_sim[1] = mean_logit_probability;
   theta_sim[2] = std_logit_probability;
   //
   // simulate y
   s_vector y(R * T);
   simulate(random_constraint, R, T, theta_sim, y);

   // lower and upper limits
   d_vector fix_constraint_lower, fix_constraint_upper;
   d_vector theta_lower(n_fixed), theta_in(n_fixed), theta_upper(n_fixed);
   //
   // theta[0] is estimate of mean_population
   theta_lower[0] = 0.0;
   theta_in[0]    = mean_population / 2.0;
   theta_upper[0] = 2.0 * mean_population;
   //
   // theta[1] is estimate of mean_logit_probability
   theta_lower[1] = mean_logit_probability - 2.0;
   theta_in[1]    = mean_logit_probability - 0.5;
   theta_upper[1] = mean_logit_probability + 2.0;
   //
   // theta[2] is estimate of std_logit_probability
   theta_lower[2] = std_logit_probability / 10.;
   theta_in[2]    = std_logit_probability / 2.0;
   theta_upper[2] = std_logit_probability * 10.;

   // check if we are starting at the simulation values
   if( start_near_solution )
   {  for(size_t j = 0; j < n_fixed; j++)
         theta_in[j] = theta_sim[j];
   }

   // random constraints
   CppAD::mixed::sparse_rc A_pattern;
   if( random_constraint )
   {  A_pattern.resize(1, T, T);
      for(size_t t = 0; t < T; t++)
         A_pattern.set(t, 0, t);
   }
   d_sparse_rcv A_rcv( A_pattern );
   for(size_t t = 0; t < A_rcv.nc(); t++)
      A_rcv.set(t, 1.0);

   // initialize random effects to start optimization at
   d_vector u_in(T);
   for(size_t t = 0; t < T; t++)
      u_in[t] = 0.0;
   //
   // create derived object
   mixed_derived mixed_object(
      R, T, K, quasi_fixed, bool_sparsity, A_rcv, y, theta_in, u_in
   );
   //
   // start timing of initialize
   size_t thread         = CppAD::thread_alloc::thread_num();
   size_t start_bytes    = CppAD::thread_alloc::inuse(thread);
   double start_seconds  = CppAD::elapsed_seconds();
   //
   mixed_object.initialize(theta_in, u_in);
   //
   double end_seconds = CppAD::elapsed_seconds();
   size_t end_bytes   = CppAD::thread_alloc::inuse(thread);
   //
   // print amoumt of memory added to mixed_object during initialize
   // (use commas to separate every three digits).
   string initialize_bytes = size_t2string(end_bytes - start_bytes);
   label_print("initialize_bytes", initialize_bytes);
   label_print("initialize_seconds", end_seconds - start_seconds);
   //
   // ipopt options for optimizing the random effects
   string random_ipopt_options =
      "Integer print_level               0\n"
      "String  sb                        yes\n"
      "String  derivative_test           none\n"
      "Numeric tol                       1e-9\n"
   ;
   if( ipopt_solve ) random_ipopt_options +=
      "String evaluation_method         ipopt_solve\n";
   //
   // ipopt options for optimizing the fixd effects
   string fixed_ipopt_options =
      "String  sb                        yes\n"
      "Numeric tol                       1e-9\n"
      "Integer max_iter                  40\n"
   ;
   if( trace_optimize_fixed )
      fixed_ipopt_options += "Integer print_level         5\n";
   else
      fixed_ipopt_options += "Integer print_level         0\n";
   //
   if( derivative_test )
      fixed_ipopt_options += "String derivative_test      first-order\n";
   else
      fixed_ipopt_options += "String derivative_test      none\n";
   //
   double inf = std::numeric_limits<double>::infinity();
   d_vector u_lower(n_random), u_upper(n_random);
   for(size_t i = 0; i < n_random; i++)
   {  u_lower[i] = -inf;
      u_upper[i] = +inf;
   }
   //
   // optimize fixed effects
   start_seconds = CppAD::elapsed_seconds();
   if( trace_optimize_fixed )
      std::cout << endl;
   d_vector theta_scale = theta_in;
   CppAD::mixed::fixed_solution solution = mixed_object.optimize_fixed(
      fixed_ipopt_options,
      random_ipopt_options,
      theta_lower,
      theta_upper,
      fix_constraint_lower,
      fix_constraint_upper,
      theta_scale,
      theta_in,
      u_lower,
      u_upper,
      u_in
   );
   end_seconds = CppAD::elapsed_seconds();
   if( trace_optimize_fixed )
      std::cout << endl;
   label_print("optimize_fixed_seconds", end_seconds - start_seconds);
   //
   // estimate of fixed effects
   d_vector theta_out = solution.fixed_opt;
   //
   // estimate of random effects
   start_seconds = CppAD::elapsed_seconds();
   d_vector u_out     = mixed_object.optimize_random(
      random_ipopt_options,
      theta_out,
      u_lower,
      u_upper,
      u_in
   );
   end_seconds = CppAD::elapsed_seconds();
   label_print("optimize_random_seconds", end_seconds - start_seconds);
   //
   // information matrix
   start_seconds = CppAD::elapsed_seconds();
   d_sparse_rcv hes_fixed_obj_rcv = mixed_object.hes_fixed_obj(
      solution.fixed_opt, u_out
   );
   end_seconds = CppAD::elapsed_seconds();
   label_print("information_mat_seconds", end_seconds - start_seconds);
   //
   // sample approximate posteroior for fixed effects
   start_seconds = CppAD::elapsed_seconds();
   d_vector sample( number_fixed_samples * n_fixed );
   mixed_object.sample_fixed(
      sample,
      hes_fixed_obj_rcv,
      solution,
      theta_lower,
      theta_upper
   );
   end_seconds = CppAD::elapsed_seconds();
   label_print("sample_fixed_seconds", end_seconds - start_seconds);
   //
   end_bytes          = CppAD::thread_alloc::inuse(thread);
   string final_bytes = size_t2string(end_bytes - start_bytes);
   label_print("final_bytes", final_bytes);
   //
   // sum of random effects
   double sum_random_effects = 0.0;
   for(size_t j = 0; j < n_random; j++)
      sum_random_effects += u_out[j];
   if( random_constraint )
      ok &= std::fabs( sum_random_effects ) < 1e-8;
   label_print("sum_random_effects", sum_random_effects);
   //
   // sample_std
   d_vector sample_std(n_fixed);
   for(size_t j = 0; j < n_fixed; j++)
      sample_std[j] = 0.0;
   for(size_t i = 0; i < number_fixed_samples; i++)
   {  for(size_t j = 0; j < n_fixed; j++)
      {  double diff    = sample[i * n_fixed + j] - theta_out[j];
         sample_std[j] += diff * diff;
      }
   }
   // estimate_ratio
   d_vector estimate_ratio(n_fixed);
   for(size_t j = 0; j < n_fixed; j++)
   {  sample_std[j] = std::sqrt(sample_std[j]/double(number_fixed_samples));
      // check if results results are reasonable
      estimate_ratio[j] = ( theta_out[j] - theta_sim[j] ) / sample_std[j];
      ok  &= std::fabs(estimate_ratio[j]) < 10.0;
   }
   // estimates
   label_print("mean_population_estimate", theta_out[0]);
   label_print("mean_logit_probability_estimate", theta_out[1]);
   label_print("std_logit_probability_estimate", theta_out[2]);
   // simulation sample standard deviations
   label_print("mean_population_std", sample_std[0]);
   label_print("mean_logit_probability_std", sample_std[1]);
   label_print("std_logit_probability_std", sample_std[2]);
   // ratios, error condition is ratio >= 5.0
   label_print("mean_population_ratio", estimate_ratio[0]);
   label_print("mean_logit_probability_ratio", estimate_ratio[1]);
   label_print("std_logit_probability_ratio", estimate_ratio[2]);
   //
   if( ok )
      label_print("capture_xam_ok", "yes");
   else
      label_print("capture_xam_ok", "no");
   //
   // make sure CppAD::thread_alloc is no longer holding memory
   CppAD::thread_alloc::hold_memory(false);
   // free memory allocated by new_gsl_rng
   CppAD::mixed::free_gsl_rng();
   if( ok )
      return 0;
   return 1;
}