State of categorical variables

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
5 messages Options
Reply | Threaded
Open this post in threaded view
|

State of categorical variables

Benjamin Deonovic
How is the state of categorical variables (or "factors" as they are called in R) going? I know The current implementation is PooledDataArrays in DataFrames. I was wondering what holes still exist, etc. Will categorical variables/factors be possible outside of a data frame? How about the famous R table function? I believe that is still missing.

--
You received this message because you are subscribed to the Google Groups "julia-stats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to [hidden email].
For more options, visit https://groups.google.com/d/optout.
Reply | Threaded
Open this post in threaded view
|

Re: State of categorical variables

Milan Bouchet-Valat
Le jeudi 12 novembre 2015 à 14:11 -0800, Benjamin Deonovic a écrit :
> How is the state of categorical variables (or "factors" as they are
> called in R) going? I know The current implementation is
> PooledDataArrays in DataFrames. I was wondering what holes still
> exist, etc. Will categorical variables/factors be possible outside of
> a data frame? How about the famous R table function? I believe that
> is still missing.
PooledDataArrays are perfectly usable outside of DataFrames.

John experimented with a replacement, but I'm not sure what's the
status of his work at the moment:
https://github.com/johnmyleswhite/CategoricalData.jl


Regarding an equivalent of table(), see
http://statsbasejl.readthedocs.org/en/latest/counts.html

as well as this small package of mine:
https://github.com/nalimilan/FreqTables.jl

I doesn't work currently with 0.4, but that should be easy to fix (I'll
do it soon).


Regards

> --
> You received this message because you are subscribed to the Google
> Groups "julia-stats" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to [hidden email].
> For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "julia-stats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to [hidden email].
For more options, visit https://groups.google.com/d/optout.
Reply | Threaded
Open this post in threaded view
|

Re: State of categorical variables

Benjamin Deonovic
Counts doesn't give the same output as table because you have to provide counts with the levels of the variables and you can only provide "ranges" as levels. Thanks for the other suggestions.

On Thursday, November 12, 2015 at 4:43:03 PM UTC-6, Milan Bouchet-Valat wrote:
Le jeudi 12 novembre 2015 à 14:11 -0800, Benjamin Deonovic a écrit :
> How is the state of categorical variables (or "factors" as they are
> called in R) going? I know The current implementation is
> PooledDataArrays in DataFrames. I was wondering what holes still
> exist, etc. Will categorical variables/factors be possible outside of
> a data frame? How about the famous R table function? I believe that
> is still missing.
PooledDataArrays are perfectly usable outside of DataFrames.

John experimented with a replacement, but I'm not sure what's the
status of his work at the moment:
<a href="https://github.com/johnmyleswhite/CategoricalData.jl" target="_blank" rel="nofollow" onmousedown="this.href=&#39;https://www.google.com/url?q\75https%3A%2F%2Fgithub.com%2Fjohnmyleswhite%2FCategoricalData.jl\46sa\75D\46sntz\0751\46usg\75AFQjCNF6n5XBR1J51IJ6gGJPsPMw-zXJag&#39;;return true;" onclick="this.href=&#39;https://www.google.com/url?q\75https%3A%2F%2Fgithub.com%2Fjohnmyleswhite%2FCategoricalData.jl\46sa\75D\46sntz\0751\46usg\75AFQjCNF6n5XBR1J51IJ6gGJPsPMw-zXJag&#39;;return true;">https://github.com/johnmyleswhite/CategoricalData.jl


Regarding an equivalent of table(), see
<a href="http://statsbasejl.readthedocs.org/en/latest/counts.html" target="_blank" rel="nofollow" onmousedown="this.href=&#39;http://www.google.com/url?q\75http%3A%2F%2Fstatsbasejl.readthedocs.org%2Fen%2Flatest%2Fcounts.html\46sa\75D\46sntz\0751\46usg\75AFQjCNFpQL3rB4EJCrGGu8KSDAGmiZN7iw&#39;;return true;" onclick="this.href=&#39;http://www.google.com/url?q\75http%3A%2F%2Fstatsbasejl.readthedocs.org%2Fen%2Flatest%2Fcounts.html\46sa\75D\46sntz\0751\46usg\75AFQjCNFpQL3rB4EJCrGGu8KSDAGmiZN7iw&#39;;return true;">http://statsbasejl.readthedocs.org/en/latest/counts.html

as well as this small package of mine:
<a href="https://github.com/nalimilan/FreqTables.jl" target="_blank" rel="nofollow" onmousedown="this.href=&#39;https://www.google.com/url?q\75https%3A%2F%2Fgithub.com%2Fnalimilan%2FFreqTables.jl\46sa\75D\46sntz\0751\46usg\75AFQjCNEuAPaptqcIcTn-VAjVPFUH4FLBOg&#39;;return true;" onclick="this.href=&#39;https://www.google.com/url?q\75https%3A%2F%2Fgithub.com%2Fnalimilan%2FFreqTables.jl\46sa\75D\46sntz\0751\46usg\75AFQjCNEuAPaptqcIcTn-VAjVPFUH4FLBOg&#39;;return true;">https://github.com/nalimilan/FreqTables.jl

I doesn't work currently with 0.4, but that should be easy to fix (I'll
do it soon).


Regards

> --
> You received this message because you are subscribed to the Google
> Groups "julia-stats" group.
> To unsubscribe from this group and stop receiving emails from it,
> send an email to <a href="javascript:" target="_blank" gdf-obfuscated-mailto="mgUDmlMYBgAJ" rel="nofollow" onmousedown="this.href=&#39;javascript:&#39;;return true;" onclick="this.href=&#39;javascript:&#39;;return true;">julia-stats...@googlegroups.com.
> For more options, visit <a href="https://groups.google.com/d/optout" target="_blank" rel="nofollow" onmousedown="this.href=&#39;https://groups.google.com/d/optout&#39;;return true;" onclick="this.href=&#39;https://groups.google.com/d/optout&#39;;return true;">https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "julia-stats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to [hidden email].
For more options, visit https://groups.google.com/d/optout.
Reply | Threaded
Open this post in threaded view
|

Re: State of categorical variables

Douglas Bates
See StatsBase::countmap for generating a table.

--
You received this message because you are subscribed to the Google Groups "julia-stats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to [hidden email].
For more options, visit https://groups.google.com/d/optout.
Reply | Threaded
Open this post in threaded view
|

Re: State of categorical variables

Benjamin Deonovic
Countmap doesn't do cross-tabulation of two variables e.g. R's table(x,y).

On Sunday, November 15, 2015 at 12:09:19 PM UTC-6, Douglas Bates wrote:
See StatsBase::countmap for generating a table.

--
You received this message because you are subscribed to the Google Groups "julia-stats" group.
To unsubscribe from this group and stop receiving emails from it, send an email to [hidden email].
For more options, visit https://groups.google.com/d/optout.