Couple documentation fixes #1847

cgarling · 2024-04-02T10:50:07Z

Have some time so fixing some of the documentation issues like #1845. Will try to document things I notice but don't feel qualified to fix.

devmotion · 2024-04-02T11:12:02Z

docs/src/multivariate.md

+pdf(::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M}) where {N,M}
+logpdf(::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M}) where {N,M}


I'm not sure... This page is about MultivariateDistribution but the methods listed here are for generic distributions with array-like variates. I think it would be better to include these generic docstrings in a separate page and only list here pdf, logpdf etc. as part of the interface for multivariate distributions with a link to the generic docstring.

That's a valid concern, I was just following the docstring for loglikelihood for array-like variates which is currently listed on the multivariate page, so I thought it would be consistent.

Distributions.jl/src/common.jl

Lines 433 to 443 in f33af97

"""

loglikelihood(d::Distribution{ArrayLikeVariate{N}}, x) where {N}

The log-likelihood of distribution `d` with respect to all variate(s) contained in `x`.

Here, `x` can be any output of `rand(d, dims...)` and `rand!(d, x)`. For instance, `x` can

be

- an array of dimension `N` with `size(x) == size(d)`,

- an array of dimension `N + 1` with `size(x)[1:N] == size(d)`, or

- an array of arrays `xi` of dimension `N` with `size(xi) == size(d)`.

"""

If you think these should be taken to a separate page, then loglikelihood probably should as well

devmotion · 2024-04-02T11:12:54Z

docs/src/univariate.md

@@ -73,7 +73,7 @@ pdfsquaredL2norm
 insupport(::UnivariateDistribution, x::Any)
 pdf(::UnivariateDistribution, ::Real)
 logpdf(::UnivariateDistribution, ::Real)
-loglikelihood(::UnivariateDistribution, ::AbstractArray)
+loglikelihood(::UnivariateDistribution, ::Real)


You almost never want to call loglikelihood with ::Real but rather with ::AbstractArray, so documenting the former instead of the latter seems wrong.

Where is that method?

Thanks for the review. It is not obvious to me why the ::AbstractArray signature would be preferred over broadcasting through the ::Real signature for UnivariateDistribution. My naive go-to would be the ::Real method as it is available for all UnivariateDistribution and the ::AbstractArray signature is not. You could make the docstring more useful by adding a sentence explaining why you may or may not want to use it.

The canonical method for single variates is just logpdf. The real benefit of loglikelihood is that it sums the logpdf values for multiple variates, possibly using exploiting constant terms etc.

loglikelihood is defined here:

Distributions.jl/src/common.jl

Lines 444 to 465 in f33af97

Base.@propagate_inbounds @inline function loglikelihood(

d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M},

) where {N,M}

if M == N

return logpdf(d, x)

else

@boundscheck begin

M > N ||

throw(DimensionMismatch(

"number of dimensions of the variates ($M) must be greater than or equal to the dimension of the distribution ($N)"

))

ntuple(i -> size(x, i), Val(N)) == size(d) ||

throw(DimensionMismatch("inconsistent array dimensions"))

end

return @inbounds sum(Base.Fix1(logpdf, d), eachvariate(x, ArrayLikeVariate{N}))

end

end

Base.@propagate_inbounds function loglikelihood(

d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:AbstractArray{<:Real,N}},

) where {N}

return sum(Base.Fix1(logpdf, d), x)

end

Ah, I was confused about "ArrayLikeVariate{N}", does this include univariate distributions?

Oh, I see, it calls down to

Distributions.jl/src/common.jl

Lines 434 to 465 in f33af97

loglikelihood(d::Distribution{ArrayLikeVariate{N}}, x) where {N}

The log-likelihood of distribution `d` with respect to all variate(s) contained in `x`.

Here, `x` can be any output of `rand(d, dims...)` and `rand!(d, x)`. For instance, `x` can

be

- an array of dimension `N` with `size(x) == size(d)`,

- an array of dimension `N + 1` with `size(x)[1:N] == size(d)`, or

- an array of arrays `xi` of dimension `N` with `size(xi) == size(d)`.

"""

Base.@propagate_inbounds @inline function loglikelihood(

d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M},

) where {N,M}

if M == N

return logpdf(d, x)

else

@boundscheck begin

M > N ||

throw(DimensionMismatch(

"number of dimensions of the variates ($M) must be greater than or equal to the dimension of the distribution ($N)"

))

ntuple(i -> size(x, i), Val(N)) == size(d) ||

throw(DimensionMismatch("inconsistent array dimensions"))

end

return @inbounds sum(Base.Fix1(logpdf, d), eachvariate(x, ArrayLikeVariate{N}))

end

end

Base.@propagate_inbounds function loglikelihood(

d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:AbstractArray{<:Real,N}},

) where {N}

return sum(Base.Fix1(logpdf, d), x)

end

because UnivariateDistribution <: Distribution{ArrayLikeVariate{0}}
For some reason I thought the univariate case was defined differently...

It seems like the real issue might be how we go about linking these general methods into the separate markdown pages, as the current @docs strings no longer work with the new methods. I don't really feel comfortable doing that myself, I don't understand the internals all that well (as shown above...)

devmotion · 2024-04-02T11:13:35Z

src/univariates.jl

+"""
+    loglikelihood(d::UnivariateDistribution, x::Real)
+
+Evaluate the logarithm of the likelihood at `x`.
+
+See also: [`logpdf`](@ref).
+"""


Same here, this is just a fallback for the case of a single variate but not the typical use case of loglikelihood.

cgarling · 2024-04-02T11:29:25Z

Some other minor docs issues I noted that I could work on with some guidance:

Some local links like [Beta distribution](@ref Beta) in the docstring for Kumaraswamy aren't working for some reason I can't figure out

Distributions.jl/src/univariate/continuous/kumaraswamy.jl

Lines 11 to 14 in f33af97

    
           It is related to the [Beta distribution](@ref Beta) by the following identity: 
        
           if ``X \\sim \\operatorname{Kumaraswamy}(a, b)`` then ``X^a \\sim \\operatorname{Beta}(1, b)``. 
        
           In particular, if ``X \\sim \\operatorname{Kumaraswamy}(1, 1)`` then 
        
           ``X \\sim \\operatorname{Uniform}(0, 1)``.

There are @docs entries for _logpdf in multivariate.md and matrix.md but no docstrings in the source. There is a commented out method in common.jl, but I'm not sure where a docstring should be added or the references should be removed

Distributions.jl/src/common.jl

Lines 276 to 279 in f33af97

    
           # `_logpdf` should be implemented and has no default definition 
        
           # _logpdf(d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,N}) where {N}

@docs entry for _rand! but no docstring; not sure what this does so can't write one

Distributions.jl/docs/src/matrix.md

Line 28 in f33af97

Distributions._rand!(::AbstractRNG, ::MatrixDistribution, A::AbstractMatrix)
Warnings for references [`Distribution`](@ref), [`ReshapedDistribution`](@ref), [`MultivariateDistribution`](@ref) in reshape.md. MultivariateDistribution appears not to have a docstring. The other two do, but they don't seem to have @docs entries and so the linking fails. I feel like MultivariateDistribution get a docstring and have an @docs entry at the top of multivariate.md and ReshapedDistribution should likewise have an @docs entry at the top of reshape.md. I would guess that Distribution should get an @docs entry on the "Type Hierarchy" page under the "Distributions" heading.

cgarling added 2 commits April 2, 2024 06:24

Fix univariate loglikelihood doc

92085f1

Fix multivariate pdf and logpdf docs

1b5a62e

devmotion reviewed Apr 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Couple documentation fixes #1847

Couple documentation fixes #1847

cgarling commented Apr 2, 2024

devmotion Apr 2, 2024

cgarling Apr 2, 2024

devmotion Apr 2, 2024

mschauer Apr 2, 2024

cgarling Apr 2, 2024

devmotion Apr 2, 2024

devmotion Apr 2, 2024

mschauer Apr 2, 2024

cgarling Apr 2, 2024

devmotion Apr 2, 2024

cgarling commented Apr 2, 2024

		pdf(::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M}) where {N,M}
		logpdf(::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M}) where {N,M}

	"""
	loglikelihood(d::Distribution{ArrayLikeVariate{N}}, x) where {N}

	The log-likelihood of distribution `d` with respect to all variate(s) contained in `x`.

	Here, `x` can be any output of `rand(d, dims...)` and `rand!(d, x)`. For instance, `x` can
	be
	- an array of dimension `N` with `size(x) == size(d)`,
	- an array of dimension `N + 1` with `size(x)[1:N] == size(d)`, or
	- an array of arrays `xi` of dimension `N` with `size(xi) == size(d)`.
	"""

	Base.@propagate_inbounds @inline function loglikelihood(
	d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:Real,M},
	) where {N,M}
	if M == N
	return logpdf(d, x)
	else
	@boundscheck begin
	M > N \|\|
	throw(DimensionMismatch(
	"number of dimensions of the variates ($M) must be greater than or equal to the dimension of the distribution ($N)"
	))
	ntuple(i -> size(x, i), Val(N)) == size(d) \|\|
	throw(DimensionMismatch("inconsistent array dimensions"))
	end
	return @inbounds sum(Base.Fix1(logpdf, d), eachvariate(x, ArrayLikeVariate{N}))
	end
	end
	Base.@propagate_inbounds function loglikelihood(
	d::Distribution{ArrayLikeVariate{N}}, x::AbstractArray{<:AbstractArray{<:Real,N}},
	) where {N}
	return sum(Base.Fix1(logpdf, d), x)
	end

Couple documentation fixes #1847

Are you sure you want to change the base?

Couple documentation fixes #1847

Conversation

cgarling commented Apr 2, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cgarling commented Apr 2, 2024