Bot releases are hidden (Show)
arrow-14.0.2
(#15108) @galipremsagarpandas-2.2
in cudf
(#15100) @galipremsagarfuture_stack
to DataFrame.stack
(#15015) @galipremsagarpandas-2.x
support in cudf
(#14916) @galipremsagardtype='category'
(#15476) @galipremsagarFalse
(#15340) @rjzamorainflate_kernel
(#15309) @vuulehostdevice_vector
and add more APIs (#15252) @ttnghiacollect_list
/collect_set
of lists column (#15243) @ttnghia.columns
by an external API (#15212) @galipremsagar__dask_tokenize__
(#15187) @rjzamoraListColumn.to_pandas()
to retain list
type (#15155) @galipremsagarconst
from range_window_bounds::_extent
. (#15138) @mythrocksGroupBy.apply
when chunk results are reindexed series (#15109) @brandon-b-millerlarge_string
in cudf
(#15093) @galipremsagarsort_values
pytest failure with pandas-2.x regression (#15092) @galipremsagarget_json_object
(#15082) @SurajAralihalliis_device_write_preferred
in void_sink
and user_sink_wrapper
(#15064) @vuuleIndex.difference
to handle duplicate values when one of the inputs is empty (#15016) @galipremsagarfuture_stack
to DataFrame.stack
(#15015) @galipremsagarDataFrame.sort_index
to respect ignore_index
on all axis (#14995) @galipremsagarSeriesGroupBy.aggregate
to SeriesGroupBy.agg
(#14971) @rjzamoraCUDF_SPILL
after a pytest (#14958) @galipremsagardeveloper_guide.md
with new guidance on quoted internal includes (#15238) @harrismsegmented_row_bit_count
for computing row sizes by segments of rows (#15169) @ttnghiaconda env create --yes
instead of --force
(#15403) @bdicecudf.pandas
(#15316) @galipremsagarcudf.pandas
pandas tests (#15284) @galipremsagarmixed_types_as_string
option is enabled (#15236) @shrshipytest-xdist
algo to worksteal
(#15207) @galipremsagarget_upstream_resource
method to stream_checking_resource_adaptor
(#15203) @misccooffset
value for generating test data in parquet_chunked_reader_test.cu
(#15200) @ttnghia__cuda_array_interface__
tests (#15188) @bdicebyte_range
in read_json
when the size is not smaller than the input data (#15180) @vuulecudf.pandas
(#15147) @galipremsagarcudf
(#15145) @galipremsagarhostdevice_vector
in kernel_error
to avoid the pageable copy (#15140) @vuulearrow-14.0.2
(#15108) @galipremsagarpandas-2.2
in cudf
(#15100) @galipremsagardatetime
binop pytest failures in pandas-2.2 (#15090) @galipremsagarcudf::sort
(#15066) @wence-DeprecationWarning
in test_hdf (#15044) @mroeschkeDeprecationWarning
's by ArrowTable.to_pandas()
(#14989) @galipremsagarctest
is called with --no-tests=error
. (#14983) @bdiceperiods
in date_range
and interval_range
(#14976) @galipremsagarGetJsonObjectOptions
in getJSONObject
Java API (#14956) @SurajAralihallipandas-2.x
support in cudf
(#14916) @galipremsagarcuco_allocator
(#14877) @PointKernelcudf.__version__
in Sphinx build (#14872) @KyleFromNVIDIArapids_cuda_set_runtime
to determine cuda runtime usage by target (#14833) @vyasrbuild_struct|list_column
(#14786) @mroeschkePublished by raydouglass 8 months ago
shuffle_method
kwarg (#14708) @pentschevint64_t
type (#14907) @SurajAralihallishuffle
deprecation to align with dask/dask (#14762) @rjzamoraGroupby.get_group
(#14728) @rjzamorashuffle_method
kwarg (#14708) @pentschevBaseOffset
(#14678) @shwinacuda::proclaim_return_type
from nested lambda (#14607) @ttnghiacudf::clamp
APIs (#14580) @davidwendtpd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulenumba<0.58
(#14616) @brandon-b-millerpytest<8
(#14920) @galipremsagarTimedeltaIndex.__init__
constructor (#14775) @mroeschkeDatetimeIndex.__init__
constructor (#14774) @mroeschkeframe.py
typing, move seldom used methods in frame.py
(#14766) @mroeschkepynvjitlink
as a dependency (#14763) @brandon-b-millercreate_structs_data
(#14761) @SurajAralihallicudf.MultiIndex.from_arrays
(#14740) @mroeschkeio::detail::orc
namespace into io::orc::detail
(#14696) @ttnghiaFieldType
to scoped enum (#14642) @vuulefreq
in DatetimeIndex (#14593) @shwinaCompactProtocolReader
(#14582) @vuuleCUDF_TEST_EXPECT_COLUMN_*
macros (#14559) @SurajAralihalliio::statistics
(#14542) @vuuleIndexedFrame.duplicated
with distinct_indices
+ scatter
(#14493) @wence-volatile
in Parquet (#14448) @vuulePublished by raydouglass 8 months ago
shuffle_method
kwarg (#14708) @pentschevint64_t
type (#14907) @SurajAralihallishuffle
deprecation to align with dask/dask (#14762) @rjzamoraGroupby.get_group
(#14728) @rjzamorashuffle_method
kwarg (#14708) @pentschevBaseOffset
(#14678) @shwinacuda::proclaim_return_type
from nested lambda (#14607) @ttnghiacudf::clamp
APIs (#14580) @davidwendtpd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulenumba<0.58
(#14616) @brandon-b-millerpytest<8
(#14920) @galipremsagarTimedeltaIndex.__init__
constructor (#14775) @mroeschkeDatetimeIndex.__init__
constructor (#14774) @mroeschkeframe.py
typing, move seldom used methods in frame.py
(#14766) @mroeschkepynvjitlink
as a dependency (#14763) @brandon-b-millercreate_structs_data
(#14761) @SurajAralihallicudf.MultiIndex.from_arrays
(#14740) @mroeschkeio::detail::orc
namespace into io::orc::detail
(#14696) @ttnghiaFieldType
to scoped enum (#14642) @vuulefreq
in DatetimeIndex (#14593) @shwinaCompactProtocolReader
(#14582) @vuuleCUDF_TEST_EXPECT_COLUMN_*
macros (#14559) @SurajAralihalliio::statistics
(#14542) @vuuleIndexedFrame.duplicated
with distinct_indices
+ scatter
(#14493) @wence-volatile
in Parquet (#14448) @vuulePublished by raydouglass 8 months ago
shuffle_method
kwarg (#14708) @pentschevint64_t
type (#14907) @SurajAralihallishuffle
deprecation to align with dask/dask (#14762) @rjzamoraGroupby.get_group
(#14728) @rjzamorashuffle_method
kwarg (#14708) @pentschevBaseOffset
(#14678) @shwinacuda::proclaim_return_type
from nested lambda (#14607) @ttnghiacudf::clamp
APIs (#14580) @davidwendtpd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulenumba<0.58
(#14616) @brandon-b-millerpytest<8
(#14920) @galipremsagarTimedeltaIndex.__init__
constructor (#14775) @mroeschkeDatetimeIndex.__init__
constructor (#14774) @mroeschkeframe.py
typing, move seldom used methods in frame.py
(#14766) @mroeschkepynvjitlink
as a dependency (#14763) @brandon-b-millercreate_structs_data
(#14761) @SurajAralihallicudf.MultiIndex.from_arrays
(#14740) @mroeschkeio::detail::orc
namespace into io::orc::detail
(#14696) @ttnghiaFieldType
to scoped enum (#14642) @vuulefreq
in DatetimeIndex (#14593) @shwinaCompactProtocolReader
(#14582) @vuuleCUDF_TEST_EXPECT_COLUMN_*
macros (#14559) @SurajAralihalliio::statistics
(#14542) @vuuleIndexedFrame.duplicated
with distinct_indices
+ scatter
(#14493) @wence-volatile
in Parquet (#14448) @vuulePublished by raydouglass 11 months ago
reindex
when index
is not unique (#14400) @galipremsagarcudf.pandas
profiler (#14514) @galipremsagarbranch-23.10
into branch-23.12
(#14435) @raydouglassreindex
when index
is not unique (#14400) @galipremsagar__getattr__
(#14388) @shwinacudf::merge
(#14345) @divyegaladask_cudf/io
pytests in CI (#14338) @galipremsagarrecover_with_nulls
is enabled (#14252) @elstehledeallocate
(#14251) @robertmaynardpynvjitlink
for CUDA 12+ MVC (#13650) @brandon-b-millertest_kurt_skew_error
(#14368) @galipremsagarnullable()
to has_nulls()
in cudf::detail::gather
(#14363) @divyegalaarrow
to 13
(#14330) @galipremsagarpyorc
dependency and use pandas
/pyarrow
instead (#14323) @galipremsagarpyarrow.fs
import for local storage (#14321) @rjzamoradask
and distributed
for 23.12
development (#14320) @galipremsagarsource_info
(#14308) @vuuleshared-action-workflows
references (#14289) @AyodeAwepartd
encode dispatch in dask_cudf
(#14287) @rjzamoracudf::merge
for nested types without using comparators (#14250) @divyegalacontains_column
by invoking contains_table
(#14238) @PointKernel23.10
into 23.12
(#14231) @galipremsagarbytes_per_second
to distinct_count of stream_compaction nvbench. (#14172) @Blonckbytes_per_second
to transpose benchmark (#14170) @Blonckbytes_per_second
to shift benchmark (#13950) @Blonckdebug_utilities.hpp/cu
from column_utilities.hpp/cu
(#13720) @ttnghiaPublished by raydouglass 11 months ago
reindex
when index
is not unique (#14400) @galipremsagarcudf.pandas
profiler (#14514) @galipremsagarbranch-23.10
into branch-23.12
(#14435) @raydouglassreindex
when index
is not unique (#14400) @galipremsagar__getattr__
(#14388) @shwinacudf::merge
(#14345) @divyegaladask_cudf/io
pytests in CI (#14338) @galipremsagarrecover_with_nulls
is enabled (#14252) @elstehledeallocate
(#14251) @robertmaynardpynvjitlink
for CUDA 12+ MVC (#13650) @brandon-b-millertest_kurt_skew_error
(#14368) @galipremsagarnullable()
to has_nulls()
in cudf::detail::gather
(#14363) @divyegalaarrow
to 13
(#14330) @galipremsagarpyorc
dependency and use pandas
/pyarrow
instead (#14323) @galipremsagarpyarrow.fs
import for local storage (#14321) @rjzamoradask
and distributed
for 23.12
development (#14320) @galipremsagarsource_info
(#14308) @vuuleshared-action-workflows
references (#14289) @AyodeAwepartd
encode dispatch in dask_cudf
(#14287) @rjzamoracudf::merge
for nested types without using comparators (#14250) @divyegalacontains_column
by invoking contains_table
(#14238) @PointKernel23.10
into 23.12
(#14231) @galipremsagarbytes_per_second
to distinct_count of stream_compaction nvbench. (#14172) @Blonckbytes_per_second
to transpose benchmark (#14170) @Blonckbytes_per_second
to shift benchmark (#13950) @Blonckdebug_utilities.hpp/cu
from column_utilities.hpp/cu
(#13720) @ttnghiaPublished by raydouglass 11 months ago
reindex
when index
is not unique (#14429) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagar23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulereindex
when index
is not unique (#14429) @galipremsagaraws-sdk-cpp<1.11
(#14173) @pentschevDataFrame
constructor (#14119) @galipremsagarDataFrame
's as input to DataFrame
constructor (#14118) @galipremsagarnumeric_only=True
for reduction operations on numeric types (#14111) @galipremsagarDataFrame
(#14110) @galipremsagarkwargs
from Series.count
(#14106) @galipremsagarIndex.to_frame
and MultiIndex.to_frame
APIs (#14105) @galipremsagar__round__
in Series
and DataFrame
(#14099) @galipremsagarSeries
and Index
(#14080) @galipremsagarconda mambabuild
rather than mamba mambabuild
(#14067) @wence-Index.intersection
(#14054) @galipremsagarIndex.difference
to match with pandas (#14053) @galipremsagarIntervalIndex.union
to preserve type-metadata (#14051) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarsort_remaining
for sort_index
(#14033) @wence-quantile
(#14014) @galipremsagarMultiIndex.difference
(#14009) @galipremsagarpd.IntervalDtype
(#14006) @galipremsagarname
selection in Index.difference
and Index.intersection
(#13986) @galipremsagardropna
to fix factorize
API (#13980) @galipremsagarMultiIndex.to_numpy
to return numpy array with tuples (#13966) @galipremsagarIntervalIndex.repr
when null values are present (#13958) @galipremsagarColumn.unique
(#13957) @galipremsagarInterval
scalars when passed in list-like inputs to cudf.Index
(#13956) @galipremsagardtype
is passed to a CategoricalColumn
(#13955) @galipremsagaras_index
in GroupBy.apply
(#13951) @brandon-b-millernsmallest
and nlargest
(#13946) @galipremsagarindex
of Groupby.apply
results when it is performed on empty objects (#13944) @galipremsagardevice_sum
functions (#13943) @brandon-b-millerGrouping
objects (#13932) @galipremsagarloc
when column names is MultiIndex
(#13929) @galipremsagarsearchsorted
(#13925) @galipremsagarname
in reindex
(#13917) @galipremsagarcudf::thread_index_type
in cuIO to prevent overflow in row indexing (#13910) @vuulethread_index_type
to avoid index overflow in grid-stride loops (#13895) @PointKernelDataFrame
with mixed types (#13889) @galipremsagarnan
when one variable to be correlated has zero variance in JIT GroupBy Apply (#13884) @brandon-b-millerread_csv
with compressed input (#13881) @vuuleMultiIndex.isin
(#13879) @galipremsagarMultiIndex.levels
(#13870) @galipremsagarfreq
is set in pandas-compatibility mode (#13857) @galipremsagarNA
from a TimedeltaColumn
(#13853) @galipremsagarfreq
(#13844) @shwinaSeries
and Index
(#13842) @galipremsagartimezone-aware
data is passed to as_column
(#13835) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagarDataFrame
constructor (#13811) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagarin
) method for Series
(#13779) @galipremsagarintegers_to_hex
(#14205) @razajafriHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14154) @ttnghiaGroupBy.value_counts
to match pandas API (#14114) @stmiohash_reduce_by_row
(#14095) @ttnghiaHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14045) @ttnghiaDataFrame.stack
to support multi level column names (#13927) @isVoiddatasource
to improve H2D throughput (#13814) @vuulecorr
in GroupBy.apply
through the jit engine (#13767) @shwinaGroupby.apply
with engine='jit'
(#13729) @brandon-b-millershared-action-workflows
references (backport from 23.12
to 23.10
) (#14300) @AyodeAwedask
and distributed
for 23.10
release (#14225) @galipremsagarRecently Updated
Check (#14193) @ajschmidt8Index
and raise warning in Series
constructor (#14116) @galipremsagarcontains_table
with cuco::static_set (#14064) @PointKernelobject
dtype will result in a failure (#13994) @galipremsagarcodes
dtype parity in pandas-compatibility mode for factorize
API (#13982) @galipremsagarCategoricalIndex
ordering in Groupby.agg
when pandas-compatibility mode is enabled (#13978) @galipremsagarthread_index_type
in partitioning.cu
(#13973) @divyegalacudf::thread_index_type
in merge.cu
(#13972) @divyegalacopy-pr-bot
(#13970) @ajschmidt8bytes_per_second
to hash_partition benchmark (#13965) @Blonckbytes_per_second
to copy_if_else benchmark (#13960) @BlonckSeries.unique
(#13959) @galipremsagarbytes_per_second
to compiled binaryop benchmark (#13938) @Blonckdask
and distributed
for 23.10
development (#13935) @galipremsagarempty()
instead of size()
where possible (#13908) @vuuleTimestamp
& Timedelta
for fetching scalars in DatetimeIndex
& TimedeltaIndex
(#13896) @galipremsagarshuffle="p2p"
within dask-cudf API (#13893) @rjzamoraDatetimeIndex
when freq
is passed to cudf.date_range
(#13890) @galipremsagardatetime
& timedelta
comparison operations (#13877) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarastype(object)
is called in pandas compatibility mode (#13862) @galipremsagarlists::contains
to experimental row comparator (#13810) @divyegalalists::contains
dispatches for scalars (#13805) @divyegala23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulePublished by raydouglass 12 months ago
dask
and distributed
for release (#13070) @galipremsagarpandas
and numpy
pinnings (#12887) @galipremsagarnames
& dtype
in Index.copy
(#12825) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarto_csv
(#12705) @wence-strings_udf
code into cuDF (#12669) @brandon-b-millerDataFrame
constructor to broadcast scalar inputs properly (#12997) @galipremsagarforce_nullable_schema
from chunked parquet writer (#12996) @galipremsagargroupby
(#12992) @galipremsagar__setitem__
on string columns when the scalar value ends in a null byte (#12991) @wence-sort_values
when column is all empty strings (#12988) @eriknwdask.dataframe.read_parquet
changes (#12983) @rjzamoraIndex.__getitem__
(#12955) @wence-RangeIndex
bug (#12838) @rjzamoraGroupBy.dtypes
(#12783) @galipremsagarnum_keys
in _scatter_by_slice
(#12749) @thomcomalways_nullable
flag to Dremel encoding (#12727) @divyegalaGroupBy.apply
(#12706) @brandon-b-millerto_csv
(#12705) @wence-round
API (#12670) @galipremsagarfrom_arrow
to load a sliced arrow table (#12665) @galipremsagarfind_common_dtype
and values
to handle complex dtypes (#12537) @galipremsagarSeries
comparison vs scalars (#12519) @brandon-b-millerUDFString
back to StringView
to call methods in strings_udf
(#12363) @brandon-b-millerGroupBy.apply
doc examples rendering (#12994) @brandon-b-millerpydata-sphinx-theme=0.13.0
compatible (#12874) @galipremsagarnullable
option application to single table writes (#12933) @vuuleio::orc::ProtobufWriter
(#12877) @ttnghiaGroupBy.apply
functions (#12802) @brandon-b-millertests/column_utilities
to use experimental::equality
row comparator (#12777) @divyegaladistinct/unique_count
to experimental::row
hasher/comparator (#12776) @divyegalahash_partition
to use experimental::row::row_hasher
(#12761) @divyegalais_sorted
to use experimental::row::lexicographic
(#12752) @divyegalaregex_program
strings splitting java APIs and tests (#12713) @cindyyuanjiangregex_program
strings replacing java APIs and tests (#12701) @cindyyuanjiangregex_program
strings extract java APIs and tests (#12699) @cindyyuanjiangstrings_udf
code into cuDF (#12669) @brandon-b-millerregex_program
searching APIs and related java classes (#12666) @cindyyuanjiangrank
to use to experimental row comparators (#12481) @divyegaladask
and distributed
for release (#13070) @galipremsagarforce_nullable_schema
parameter to Parquet writer. (#12952) @galipremsagargroupby.head
and groupby.tail
(#12939) @wence-SCCACHE_S3_USE_SSL
to conda builds (#12910) @ajschmidt8moto
env vars & pass AWS_SESSION_TOKEN
to conda builds (#12902) @ajschmidt8line_terminator
in favor of lineterminator
in to_csv
(#12896) @wence-stream
and mr
parameters for structs::detail::flatten_nested_columns
(#12892) @ttnghiapandas
and numpy
pinnings (#12887) @galipremsagargroupby.sample
(#12882) @wence-cudf::structs::detail::flatten_nested_columns
to smart pointer (#12878) @ttnghianames
& dtype
in Index.copy
(#12825) @galipremsagarinplace
parameters in categorical methods (#12824) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarcudf::io::source_info
to use cudf::host_span<std::byte>
in a non-breaking form (#12730) @hyperbolic2346dask
and distributed
for development (#12710) @galipremsagarpartition_by_hash()
: use _split()
(#12704) @madsbkexperimental::row::equality
(#12676) @divyegalaconda
recipe (#12661) @ajschmidt8copy-on-write
feature branch into branch-23.04
(#12619) @galipremsagarPublished by raydouglass about 1 year ago
MixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagar23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuuleaws-sdk-cpp<1.11
(#14173) @pentschevDataFrame
constructor (#14119) @galipremsagarDataFrame
's as input to DataFrame
constructor (#14118) @galipremsagarnumeric_only=True
for reduction operations on numeric types (#14111) @galipremsagarDataFrame
(#14110) @galipremsagarkwargs
from Series.count
(#14106) @galipremsagarIndex.to_frame
and MultiIndex.to_frame
APIs (#14105) @galipremsagar__round__
in Series
and DataFrame
(#14099) @galipremsagarSeries
and Index
(#14080) @galipremsagarconda mambabuild
rather than mamba mambabuild
(#14067) @wence-Index.intersection
(#14054) @galipremsagarIndex.difference
to match with pandas (#14053) @galipremsagarIntervalIndex.union
to preserve type-metadata (#14051) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarsort_remaining
for sort_index
(#14033) @wence-quantile
(#14014) @galipremsagarMultiIndex.difference
(#14009) @galipremsagarpd.IntervalDtype
(#14006) @galipremsagarname
selection in Index.difference
and Index.intersection
(#13986) @galipremsagardropna
to fix factorize
API (#13980) @galipremsagarMultiIndex.to_numpy
to return numpy array with tuples (#13966) @galipremsagarIntervalIndex.repr
when null values are present (#13958) @galipremsagarColumn.unique
(#13957) @galipremsagarInterval
scalars when passed in list-like inputs to cudf.Index
(#13956) @galipremsagardtype
is passed to a CategoricalColumn
(#13955) @galipremsagaras_index
in GroupBy.apply
(#13951) @brandon-b-millernsmallest
and nlargest
(#13946) @galipremsagarindex
of Groupby.apply
results when it is performed on empty objects (#13944) @galipremsagardevice_sum
functions (#13943) @brandon-b-millerGrouping
objects (#13932) @galipremsagarloc
when column names is MultiIndex
(#13929) @galipremsagarsearchsorted
(#13925) @galipremsagarname
in reindex
(#13917) @galipremsagarcudf::thread_index_type
in cuIO to prevent overflow in row indexing (#13910) @vuulethread_index_type
to avoid index overflow in grid-stride loops (#13895) @PointKernelDataFrame
with mixed types (#13889) @galipremsagarnan
when one variable to be correlated has zero variance in JIT GroupBy Apply (#13884) @brandon-b-millerread_csv
with compressed input (#13881) @vuuleMultiIndex.isin
(#13879) @galipremsagarMultiIndex.levels
(#13870) @galipremsagarfreq
is set in pandas-compatibility mode (#13857) @galipremsagarNA
from a TimedeltaColumn
(#13853) @galipremsagarfreq
(#13844) @shwinaSeries
and Index
(#13842) @galipremsagartimezone-aware
data is passed to as_column
(#13835) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagarDataFrame
constructor (#13811) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagarin
) method for Series
(#13779) @galipremsagarintegers_to_hex
(#14205) @razajafriHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14154) @ttnghiaGroupBy.value_counts
to match pandas API (#14114) @stmiohash_reduce_by_row
(#14095) @ttnghiaHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14045) @ttnghiaDataFrame.stack
to support multi level column names (#13927) @isVoiddatasource
to improve H2D throughput (#13814) @vuulecorr
in GroupBy.apply
through the jit engine (#13767) @shwinaGroupby.apply
with engine='jit'
(#13729) @brandon-b-millerdask
and distributed
for 23.10
release (#14225) @galipremsagarRecently Updated
Check (#14193) @ajschmidt8Index
and raise warning in Series
constructor (#14116) @galipremsagarcontains_table
with cuco::static_set (#14064) @PointKernelobject
dtype will result in a failure (#13994) @galipremsagarcodes
dtype parity in pandas-compatibility mode for factorize
API (#13982) @galipremsagarCategoricalIndex
ordering in Groupby.agg
when pandas-compatibility mode is enabled (#13978) @galipremsagarthread_index_type
in partitioning.cu
(#13973) @divyegalacudf::thread_index_type
in merge.cu
(#13972) @divyegalacopy-pr-bot
(#13970) @ajschmidt8bytes_per_second
to hash_partition benchmark (#13965) @Blonckbytes_per_second
to copy_if_else benchmark (#13960) @BlonckSeries.unique
(#13959) @galipremsagarbytes_per_second
to compiled binaryop benchmark (#13938) @Blonckdask
and distributed
for 23.10
development (#13935) @galipremsagarempty()
instead of size()
where possible (#13908) @vuuleTimestamp
& Timedelta
for fetching scalars in DatetimeIndex
& TimedeltaIndex
(#13896) @galipremsagarshuffle="p2p"
within dask-cudf API (#13893) @rjzamoraDatetimeIndex
when freq
is passed to cudf.date_range
(#13890) @galipremsagardatetime
& timedelta
comparison operations (#13877) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarastype(object)
is called in pandas compatibility mode (#13862) @galipremsagarlists::contains
to experimental row comparator (#13810) @divyegalalists::contains
dispatches for scalars (#13805) @divyegala23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulePublished by raydouglass about 1 year ago
_get_decimal_type
(#13224) @charlesblucadask_cudf.read_parquet
(#13697) @rjzamoracuda-nvcc-impl
to cudf
for numba
CUDA 12 (#13673) @jakirkhamread_parquet
(#13666) @rjzamorathrust::identity
as hash functions for byte pair encoding (#13665) @PointKernelMultiIndex.loc
ordering & fix a bug in Groupby
with as_index
(#13657) @galipremsagarhas_nonempty_nulls
ignoring column offset (#13647) @ttnghiaIndex
search to simplify code and increase correctness (#13625) @wence-localize(None)
for timezone-naive columns (#13603) @shwinadask_cudf.read_csv
when lines are needed to be skipped (#13555) @galipremsagarsingle_lane_block_sum_reduce
is safe to call in a loop (#13488) @wence-_can_be_jitted
in GroupBy Apply (#13429) @brandon-b-miller_get_decimal_type
(#13224) @charlesblucamin
and max
aggregations for list type in groupby and reduction (#13676) @ttnghiacuco::static_map
to build string dictionaries in ORC writer (#13580) @vuulelists::concatenate_list_elements
(#13547) @ttnghialists::concatenate_list_elements
(#13545) @ttnghiafind_index_of_val
(#13517) @brandon-b-millerabs
function to apply (#13408) @brandon-b-millerstruct_minmax_util
to experimental row comparator (#13069) @divyegaladask
and distributed
for 23.08
release (#13802) @galipremsagar__dask_tokenize__
definitions to cudf classes (#13695) @rjzamoracuda::atomic_ref
(#13583) @PointKernelcudf.Dataframe
to/from pyarrow.Table
conversion (#13558) @rjzamoracuco::pair_type
aliases (#13553) @PointKernelsort=False
in Groupby
(#13551) @galipremsagarstrings_to_categorical
in cudf.read_parquet
(#13540) @galipremsagarhostdevice_*
copying APIs (#13530) @ttnghiacudf.option_context
(#13511) @galipremsagardask
and distributed
for development (#13508) @galipremsagarPublished by raydouglass over 1 year ago
observed=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaarrow-11
(#12757) @galipremsagarIntervalIndex
constructor (#13394) @shwinafrom_column_view
(#13315) @wence-compute_column
(#13245) @wence-is_decompression_disabled
and is_compression_disabled
thread-safe (#13240) @vuulehostdevice_vector::subspan
(#13187) @ttnghiacudf::nvbench_base_fixture
usage (#13183) @robertmaynardSeries
and DataFrame
constructors to validate index lengths (#13122) @galipremsagarnull_count
of columns returned by chunked_parquet_reader
(#13111) @vuuleNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaread_parquet
benchmarks (#13082) @vuuleskip_rows
in ORC and Parquet readers (#13063) @vuulecontains_table
to experimental row hasher and equality comparator (#13119) @divyegalaGroupBy.apply
algorithm based on if the UDF is jittable (#13113) @brandon-b-millermixed_join
to use experimental row hasher and comparator (#13028) @divyegalahostdevice_span
that is a span createable from hostdevice_vector
(#12981) @hyperbolic2346join
to use experimental row hasher and comparator (#12787) @divyegaladtype
parameter in get_dummies
(#13365) @galipremsagarStringIndex
and use Index
instead (#13361) @galipremsagarTypeVar(bound="Foo")
(#13346) @wence-distinct_count
with cuco::static_set
(#13343) @PointKernelcontiguous_split
performance (#13342) @ttnghiaread_parquet
(#13334) @rjzamorameta
calculation in dask_cudf.read_parquet
(#13327) @rjzamoradistinct_count
benchmark (#13321) @PointKernelobserved=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaBaseDtype
when infering the data type of an arbitrary object (#13295) @shwinatransform_lists_of_structs
in row_operators.cu
(#13288) @ttnghiapartition
scatter_map
construction (#13272) @wence-pandas_compatible
option in cudf
(#13241) @galipremsagargpuDecideCompression
(#13202) @vuulecreate_serialized_trie
(#13201) @vuulepad
and backfill
methods (#13140) @galipremsagarUNKNOWN_NULL_COUNT
(#13134) @vyasrUNKNOWN_NULL_COUNT
(#13102) @vyasr.element()
instead of .data()
for window range calculations (#13095) @mythrocksbranch-23.04
to branch-23.06
(#13080) @galipremsagarcudf::detail::sorted_order
(#13062) @ttnghiabranch-23.04
to branch-23.06
(#13009) @galipremsagararrow-11
(#12757) @galipremsagarPublished by raydouglass over 1 year ago
observed=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaarrow-11
(#12757) @galipremsagarIntervalIndex
constructor (#13394) @shwinafrom_column_view
(#13315) @wence-compute_column
(#13245) @wence-is_decompression_disabled
and is_compression_disabled
thread-safe (#13240) @vuulehostdevice_vector::subspan
(#13187) @ttnghiacudf::nvbench_base_fixture
usage (#13183) @robertmaynardSeries
and DataFrame
constructors to validate index lengths (#13122) @galipremsagarnull_count
of columns returned by chunked_parquet_reader
(#13111) @vuuleNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaread_parquet
benchmarks (#13082) @vuuleskip_rows
in ORC and Parquet readers (#13063) @vuulecontains_table
to experimental row hasher and equality comparator (#13119) @divyegalaGroupBy.apply
algorithm based on if the UDF is jittable (#13113) @brandon-b-millermixed_join
to use experimental row hasher and comparator (#13028) @divyegalahostdevice_span
that is a span createable from hostdevice_vector
(#12981) @hyperbolic2346join
to use experimental row hasher and comparator (#12787) @divyegaladtype
parameter in get_dummies
(#13365) @galipremsagarStringIndex
and use Index
instead (#13361) @galipremsagarTypeVar(bound="Foo")
(#13346) @wence-distinct_count
with cuco::static_set
(#13343) @PointKernelcontiguous_split
performance (#13342) @ttnghiaread_parquet
(#13334) @rjzamorameta
calculation in dask_cudf.read_parquet
(#13327) @rjzamoradistinct_count
benchmark (#13321) @PointKernelobserved=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaBaseDtype
when infering the data type of an arbitrary object (#13295) @shwinatransform_lists_of_structs
in row_operators.cu
(#13288) @ttnghiapartition
scatter_map
construction (#13272) @wence-pandas_compatible
option in cudf
(#13241) @galipremsagargpuDecideCompression
(#13202) @vuulecreate_serialized_trie
(#13201) @vuulepad
and backfill
methods (#13140) @galipremsagarUNKNOWN_NULL_COUNT
(#13134) @vyasrUNKNOWN_NULL_COUNT
(#13102) @vyasr.element()
instead of .data()
for window range calculations (#13095) @mythrocksbranch-23.04
to branch-23.06
(#13080) @galipremsagarcudf::detail::sorted_order
(#13062) @ttnghiabranch-23.04
to branch-23.06
(#13009) @galipremsagararrow-11
(#12757) @galipremsagarPublished by raydouglass over 1 year ago
dask
and distributed
for release (#13070) @galipremsagarpandas
and numpy
pinnings (#12887) @galipremsagarnames
& dtype
in Index.copy
(#12825) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarto_csv
(#12705) @wence-strings_udf
code into cuDF (#12669) @brandon-b-millerDataFrame
constructor to broadcast scalar inputs properly (#12997) @galipremsagarforce_nullable_schema
from chunked parquet writer (#12996) @galipremsagargroupby
(#12992) @galipremsagar__setitem__
on string columns when the scalar value ends in a null byte (#12991) @wence-sort_values
when column is all empty strings (#12988) @eriknwdask.dataframe.read_parquet
changes (#12983) @rjzamoraIndex.__getitem__
(#12955) @wence-RangeIndex
bug (#12838) @rjzamoraGroupBy.dtypes
(#12783) @galipremsagarnum_keys
in _scatter_by_slice
(#12749) @thomcomalways_nullable
flag to Dremel encoding (#12727) @divyegalaGroupBy.apply
(#12706) @brandon-b-millerto_csv
(#12705) @wence-round
API (#12670) @galipremsagarfrom_arrow
to load a sliced arrow table (#12665) @galipremsagarfind_common_dtype
and values
to handle complex dtypes (#12537) @galipremsagarSeries
comparison vs scalars (#12519) @brandon-b-millerUDFString
back to StringView
to call methods in strings_udf
(#12363) @brandon-b-millerGroupBy.apply
doc examples rendering (#12994) @brandon-b-millerpydata-sphinx-theme=0.13.0
compatible (#12874) @galipremsagarnullable
option application to single table writes (#12933) @vuuleio::orc::ProtobufWriter
(#12877) @ttnghiaGroupBy.apply
functions (#12802) @brandon-b-millertests/column_utilities
to use experimental::equality
row comparator (#12777) @divyegaladistinct/unique_count
to experimental::row
hasher/comparator (#12776) @divyegalahash_partition
to use experimental::row::row_hasher
(#12761) @divyegalais_sorted
to use experimental::row::lexicographic
(#12752) @divyegalaregex_program
strings splitting java APIs and tests (#12713) @cindyyuanjiangregex_program
strings replacing java APIs and tests (#12701) @cindyyuanjiangregex_program
strings extract java APIs and tests (#12699) @cindyyuanjiangstrings_udf
code into cuDF (#12669) @brandon-b-millerregex_program
searching APIs and related java classes (#12666) @cindyyuanjiangrank
to use to experimental row comparators (#12481) @divyegaladask
and distributed
for release (#13070) @galipremsagarforce_nullable_schema
parameter to Parquet writer. (#12952) @galipremsagargroupby.head
and groupby.tail
(#12939) @wence-SCCACHE_S3_USE_SSL
to conda builds (#12910) @ajschmidt8moto
env vars & pass AWS_SESSION_TOKEN
to conda builds (#12902) @ajschmidt8line_terminator
in favor of lineterminator
in to_csv
(#12896) @wence-stream
and mr
parameters for structs::detail::flatten_nested_columns
(#12892) @ttnghiapandas
and numpy
pinnings (#12887) @galipremsagargroupby.sample
(#12882) @wence-cudf::structs::detail::flatten_nested_columns
to smart pointer (#12878) @ttnghianames
& dtype
in Index.copy
(#12825) @galipremsagarinplace
parameters in categorical methods (#12824) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarcudf::io::source_info
to use cudf::host_span<std::byte>
in a non-breaking form (#12730) @hyperbolic2346dask
and distributed
for development (#12710) @galipremsagarpartition_by_hash()
: use _split()
(#12704) @madsbkexperimental::row::equality
(#12676) @divyegalaconda
recipe (#12661) @ajschmidt8copy-on-write
feature branch into branch-23.04
(#12619) @galipremsagarPublished by raydouglass over 1 year ago
dask
and distributed
for release (#12695) @galipremsagarptr
in Buffer
(#12587) @galipremsagarcudf::io::read_json
to nested JSON parser (#12544) @vuuleengine=cudf
to the new JSON
reader (#12509) @galipremsagararrow-10.0.1
(#12327) @galipremsagarread_orc
(#12325) @vuulesuperimpose_nulls
and push_down_nulls
(#12239) @ttnghiacudf::structs::detail::superimpose_parent_nulls
APIs (#12230) @ttnghiaget_next_chunk()
should use the blocking function device_read()
(#12584) @madsbkpartition_by_hash()
: support index (#12554) @madsbkRAPIDS
magic value (#12525) @robertmaynardListColumn
(#12487) @galipremsagarcudf::make_lists_column
(#12370) @ttnghiaread_orc
(#12325) @vuulejson_chunked_reader.cpp
(#12280) @ttnghiaconcat
for singleton concatenations (#12247) @wence-superimpose_nulls
and push_down_nulls
(#12239) @ttnghiastrings_udf
doc update (#12469) @brandon-b-millerhas_nonempty_nulls
instead of may_contain_non_empty_nulls
in superimpose_nulls
and push_down_nulls
(#12560) @ttnghiaregex_program
java APIs and unit tests (#12548) @cindyyuanjiangcudf::io::read_json
to nested JSON parser (#12544) @vuuleone_hot_encode
to use experimental row comparators (#12478) @divyegalathrust_copy_if
into cudf::detail::copy_if_safe
(#12455) @ttnghiatokenize_json.hpp
detail header from src/io/json/nested_json.hpp
(#12432) @ttnghialists::reverse
(#12336) @ttnghiadevice_read
in experimental read_json
(#12314) @vuulestrings::reverse
(#12283) @ttnghiahostdevice_vector
(#12251) @vuulereplace
in strings_udf
(#12207) @brandon-b-millerstream_compaction/unique
to use new row comparators (#12159) @divyegaladask
and distributed
for release (#12695) @galipremsagarlibcudf-example
to Anaconda.org (#12671) @ajschmidt8ptr
in Buffer
(#12587) @galipremsagarcategory
dtypes in CSV reader (#12571) @galipremsagarspill_lock
parameter from SpillableBuffer.get_ptr()
(#12564) @madsbkcudf::make_lists_column
(#12547) @ttnghiacudf::strings::repeat_strings_output_sizes
from Java and JNI (#12546) @ttnghia@acquire_spill_lock()
and as_buffer(..., exposed=False)
(#12535) @madsbkengine=cudf
to the new JSON
reader (#12509) @galipremsagarint32
hard-coding in python (#12467) @galipremsagar11.8
and Python 3.10
Packages (#12457) @ajschmidt8Recently Updated
Check (#12435) @ajschmidt8ops-bot.yaml
(#12402) @ajschmidt8numpy.empty()
instead of bytearray
to allocate host memory for spilling (#12399) @madsbkarrow-10.0.1
(#12327) @galipremsagartest_notebooks.sh
(#12323) @ajschmidt8check_style.sh
script (#12320) @ajschmidt8dask
and distributed
for development (#12302) @galipremsagarsccache
for Jenkins builds (#12297) @ajschmidt8Index.to_pandas
(#12268) @galipremsagarduplicated
support for Series
, DataFrame
and Index
(#12246) @galipremsagar@acquire_spill_lock()
(#12232) @madsbkcudf::structs::detail::superimpose_parent_nulls
APIs (#12230) @ttnghiaread_json
tests (#12229) @vuuletest_spilling.py
(#12220) @madsbkSeries.hasnans
and Index.hasnans
(#12214) @galipremsagar_label_encoding
from Series to Column (#12040) @shwinagroupby_agg
calls in one place (#10835) @charlesblucaPublished by GPUtester almost 2 years ago
substring
without 'end' parameter. (#12113) @firestarmanpurge_nonempty_nulls
(#12111) @ttnghiaint8
column in read_csv
when all elements are missing (#12110) @vuuleLIBCUDF_CUFILE_POLICY
is set to "ALWAYS"
(#12080) @vuule**kwargs
and *args
in IO
readers & writers (#12025) @galipremsagarDeviceBufferLike
(#12009) @madsbkmanaged_allocator
(#12005) @vyasrwrite_csv
instead of table_metadata
pointer (#11972) @vuulefrom_dict
backend dispatch to match upstream dask
(#12203) @galipremsagarudf_string
pointers in strings_udf
(#12138) @brandon-b-millersubstring
without 'end' parameter. (#12113) @firestarmanGzipFile
type (#12085) @galipremsagar__setitem__
edge case (#12075) @wence-shim_60.ptx
by default in strings_udf
(#12056) @brandon-b-millermemory_usage
& items
implementation for Struct
column & dtype (#12033) @galipremsagar**kwargs
and *args
in IO
readers & writers (#12025) @galipremsagarusecols
and names
options are used in read_csv
(#12018) @vuuleconstexpr
" (#11999) @vuuledf.loc
resulting in single row could give wrong index (#11998) @eriknwtruncate
API to python doc pages (#12109) @galipremsagarCONTRIBUTING.md
(#12084) @brandon-b-millercudf
specific dtypes (#11974) @galipremsagar+
in strings_udf
(#12117) @brandon-b-millerupper
and lower
in strings_udf
(#12099) @brandon-b-millerstrip
, lstrip
, and rstrip
in strings_udf
(#12091) @brandon-b-millerlike
jni and native method (#12032) @cindyyuanjiangMaskedType
(#11987) @brandon-b-millerapply
(#11933) @brandon-b-millerstrings_udf
(#11884) @brandon-b-millerread_orc_metadata
to libcudf (#11815) @vuulespilling
(#12197) @madsbkdask
and distributed
for release (#12165) @galipremsagarpurge_nonempty_nulls
(#12111) @ttnghiaint8
column in read_csv
when all elements are missing (#12110) @vuulepd.read_orc
changes in tests (#12103) @galipremsagardask
nigthly install in CI (#12082) @galipremsagarLIBCUDF_CUFILE_POLICY
is set to "ALWAYS"
(#12080) @vuuleDataFrame.from_dict
`to_dictand
Series.to_dict` (#12048) @galipremsagarDeviceBufferLike
(#12009) @madsbkmanaged_allocator
(#12005) @vyasrconstexpr
(#11980) @vuulewrite_csv
instead of table_metadata
pointer (#11972) @vuulestrip_delimiters
option to read_text
(#11946) @upsjoutput_builder
(#11945) @upsj.str.find_multiple
API (#11928) @galipremsagarnanosecond
& microsecond
to DatetimeProperties
(#11911) @galipremsagarListColumn
or any new unsupported column in cudf.Index
(#11902) @galipremsagarcodecov
threshold diff (#11899) @galipremsagardata_chunk_source
wrapper for datasource
(#11886) @upsjmultibyte_split
byte_range out-of-bounds offsets on host (#11885) @upsjdask
and distributed
for development (#11859) @galipremsagarpyorc
(#11855) @jakirkhamcudf_io
namespace alias (#11827) @vuuleread_text
(#11802) @upsjnvcc
warnings into errors (#8916) @trxcllntPublished by GPUtester almost 2 years ago
substring
without 'end' parameter. (#12113) @firestarmanpurge_nonempty_nulls
(#12111) @ttnghiaint8
column in read_csv
when all elements are missing (#12110) @vuuleLIBCUDF_CUFILE_POLICY
is set to "ALWAYS"
(#12080) @vuule**kwargs
and *args
in IO
readers & writers (#12025) @galipremsagarDeviceBufferLike
(#12009) @madsbkmanaged_allocator
(#12005) @vyasrwrite_csv
instead of table_metadata
pointer (#11972) @vuulefrom_dict
backend dispatch to match upstream dask
(#12203) @galipremsagarudf_string
pointers in strings_udf
(#12138) @brandon-b-millersubstring
without 'end' parameter. (#12113) @firestarmanGzipFile
type (#12085) @galipremsagar__setitem__
edge case (#12075) @wence-shim_60.ptx
by default in strings_udf
(#12056) @brandon-b-millermemory_usage
& items
implementation for Struct
column & dtype (#12033) @galipremsagar**kwargs
and *args
in IO
readers & writers (#12025) @galipremsagarusecols
and names
options are used in read_csv
(#12018) @vuuleconstexpr
" (#11999) @vuuledf.loc
resulting in single row could give wrong index (#11998) @eriknwtruncate
API to python doc pages (#12109) @galipremsagarCONTRIBUTING.md
(#12084) @brandon-b-millercudf
specific dtypes (#11974) @galipremsagar+
in strings_udf
(#12117) @brandon-b-millerupper
and lower
in strings_udf
(#12099) @brandon-b-millerstrip
, lstrip
, and rstrip
in strings_udf
(#12091) @brandon-b-millerlike
jni and native method (#12032) @cindyyuanjiangMaskedType
(#11987) @brandon-b-millerapply
(#11933) @brandon-b-millerstrings_udf
(#11884) @brandon-b-millerread_orc_metadata
to libcudf (#11815) @vuulespilling
(#12197) @madsbkdask
and distributed
for release (#12165) @galipremsagarpurge_nonempty_nulls
(#12111) @ttnghiaint8
column in read_csv
when all elements are missing (#12110) @vuulepd.read_orc
changes in tests (#12103) @galipremsagardask
nigthly install in CI (#12082) @galipremsagarLIBCUDF_CUFILE_POLICY
is set to "ALWAYS"
(#12080) @vuuleDataFrame.from_dict
`to_dictand
Series.to_dict` (#12048) @galipremsagarDeviceBufferLike
(#12009) @madsbkmanaged_allocator
(#12005) @vyasrconstexpr
(#11980) @vuulewrite_csv
instead of table_metadata
pointer (#11972) @vuulestrip_delimiters
option to read_text
(#11946) @upsjoutput_builder
(#11945) @upsj.str.find_multiple
API (#11928) @galipremsagarnanosecond
& microsecond
to DatetimeProperties
(#11911) @galipremsagarListColumn
or any new unsupported column in cudf.Index
(#11902) @galipremsagarcodecov
threshold diff (#11899) @galipremsagardata_chunk_source
wrapper for datasource
(#11886) @upsjmultibyte_split
byte_range out-of-bounds offsets on host (#11885) @upsjdask
and distributed
for development (#11859) @galipremsagarpyorc
(#11855) @jakirkhamcudf_io
namespace alias (#11827) @vuuleread_text
(#11802) @upsjnvcc
warnings into errors (#8916) @trxcllntPublished by GPUtester almost 2 years ago
Index.isna
& Index.notna
(#11769) @galipremsagarkwargs
in read_csv
& to_csv
(#11762) @galipremsagarcudf::partition*
APIs that do not return offsets for empty output table (#11709) @ttnghiapandas
to 1.5
(#11617) @galipremsagarordered
to False
in CategoricalDtype
(#11604) @galipremsagarskiprows
and num_rows
in read_orc
(#11522) @galipremsagarskiprows
and num_rows
in cudf.read_parquet
(#11480) @galipremsagarBuffer
class (#11447) @madsbkcolumns
option (#11446) @vuuleptx
file paths during strings_udf
import (#11862) @galipremsagarstrings_udf
CEC and solve several related issues (#11846) @brandon-b-milleris_valid
checks in Scalar._binaryop
(#11818) @wence-NotImplemented
issue with numpy
(#11816) @galipremsagarstrings_udf
package with other python packages in nightlies (#11808) @brandon-b-millercudf
locally before building strings_udf
conda packages in CI (#11785) @brandon-b-millerIndex.isna
& Index.notna
(#11769) @galipremsagarlist
and struct
types (#11760) @galipremsagarstrings_udf
package for python 3.9 (#11730) @brandon-b-millercudf::partition*
APIs that do not return offsets for empty output table (#11709) @ttnghiaDataFrame.from_arrow
to preserve type metadata (#11698) @galipremsagarto_orc
when using cuDF or Dask (#11690) @vuuleMultindex
when group_keys=True
(#11689) @galipremsagarmypy
checks (#11685) @galipremsagar.loc
(#11677) @shwinadevice_write()
: it uses an incorrect size (#11651) @madsbkordered
to False
in CategoricalDtype
(#11604) @galipremsagarto_arrow
when column name type is not a string (#11590) @galipremsagarquantiles
benchmark (#11584) @vuuledask
& distributed
nightly/stable installs (#11565) @galipremsagarskiprows
& num_rows
(#11505) @galipremsagarcolumns
option (#11446) @vuuleguide-to-udfs
notebook (#11861) @brandon-b-millerlist
& struct
handling (#11770) @galipremsagarDataFrame.apply
and Series.apply
and update guide to UDFs notebook (#11733) @brandon-b-millercudf.get_dummies
docstring (#11443) @galipremsagaristitle
to string UDFs (#11738) @brandon-b-millerdata_chunk_reader
(#11652) @upsjcontiguousSplitGroups
(#11614) @res-lifegdb
pretty-printers for simple types (#11499) @upsjcreate_random_column
function to the data generator (#11490) @vuuledata_profile
(#11479) @vuuleDataFrame.apply
and Series.apply
(#11319) @brandon-b-millercudf::contains
(#10656) @ttnghialists::contains
(#10548) @ttnghiadask
and distributed
for release (#11822) @galipremsagarkwargs
in read_csv
& to_csv
(#11762) @galipremsagardtype
param to avoid pd.Series
warnings (#11761) @galipremsagarschema_element
& keep_quotes
support in json reader (#11746) @galipremsagarListColumn
when size is None
(#11745) @galipremsagar__dataframe__
method to the protocol dataframe object (#11692) @rgommersgroup_keys
in groupby
(#11659) @galipremsagarnullCount
in get_json_object
(#11633) @trxcllnthostdevice_vector
(#11631) @upsjpandas
to 1.5
(#11617) @galipremsagarif constexpr
(#11581) @ttnghialist(struct)
columns as map
type in orc writer (#11568) @galipremsagarskiprows
and num_rows
in read_orc
(#11522) @galipremsagarNULL_EQUALS
binary operation (#11520) @rwleearrow-9.x
(#11507) @galipremsagarmultibyte_split
(#11500) @upsjdask
and distributed
for development (#11492) @galipremsagarskiprows
and num_rows
in cudf.read_parquet
(#11480) @galipremsagardistinct_count
benchmark (#11473) @ttnghianunique
aggregation benchmark (#11472) @ttnghiamax
aggregation benchmark (#11464) @ttnghiaBuffer
class (#11447) @madsbkPublished by GPUtester about 2 years ago
Index.isna
& Index.notna
(#11769) @galipremsagarkwargs
in read_csv
& to_csv
(#11762) @galipremsagarcudf::partition*
APIs that do not return offsets for empty output table (#11709) @ttnghiapandas
to 1.5
(#11617) @galipremsagarordered
to False
in CategoricalDtype
(#11604) @galipremsagarskiprows
and num_rows
in read_orc
(#11522) @galipremsagarskiprows
and num_rows
in cudf.read_parquet
(#11480) @galipremsagarBuffer
class (#11447) @madsbkcolumns
option (#11446) @vuuleptx
file paths during strings_udf
import (#11862) @galipremsagarstrings_udf
CEC and solve several related issues (#11846) @brandon-b-milleris_valid
checks in Scalar._binaryop
(#11818) @wence-NotImplemented
issue with numpy
(#11816) @galipremsagarstrings_udf
package with other python packages in nightlies (#11808) @brandon-b-millercudf
locally before building strings_udf
conda packages in CI (#11785) @brandon-b-millerIndex.isna
& Index.notna
(#11769) @galipremsagarlist
and struct
types (#11760) @galipremsagarstrings_udf
package for python 3.9 (#11730) @brandon-b-millercudf::partition*
APIs that do not return offsets for empty output table (#11709) @ttnghiaDataFrame.from_arrow
to preserve type metadata (#11698) @galipremsagarto_orc
when using cuDF or Dask (#11690) @vuuleMultindex
when group_keys=True
(#11689) @galipremsagarmypy
checks (#11685) @galipremsagar.loc
(#11677) @shwinadevice_write()
: it uses an incorrect size (#11651) @madsbkordered
to False
in CategoricalDtype
(#11604) @galipremsagarto_arrow
when column name type is not a string (#11590) @galipremsagarquantiles
benchmark (#11584) @vuuledask
& distributed
nightly/stable installs (#11565) @galipremsagarskiprows
& num_rows
(#11505) @galipremsagarcolumns
option (#11446) @vuuleguide-to-udfs
notebook (#11861) @brandon-b-millerlist
& struct
handling (#11770) @galipremsagarDataFrame.apply
and Series.apply
and update guide to UDFs notebook (#11733) @brandon-b-millercudf.get_dummies
docstring (#11443) @galipremsagaristitle
to string UDFs (#11738) @brandon-b-millerdata_chunk_reader
(#11652) @upsjcontiguousSplitGroups
(#11614) @res-lifegdb
pretty-printers for simple types (#11499) @upsjcreate_random_column
function to the data generator (#11490) @vuuledata_profile
(#11479) @vuuleDataFrame.apply
and Series.apply
(#11319) @brandon-b-millercudf::contains
(#10656) @ttnghialists::contains
(#10548) @ttnghiadask
and distributed
for release (#11822) @galipremsagarkwargs
in read_csv
& to_csv
(#11762) @galipremsagardtype
param to avoid pd.Series
warnings (#11761) @galipremsagarschema_element
& keep_quotes
support in json reader (#11746) @galipremsagarListColumn
when size is None
(#11745) @galipremsagar__dataframe__
method to the protocol dataframe object (#11692) @rgommersgroup_keys
in groupby
(#11659) @galipremsagarnullCount
in get_json_object
(#11633) @trxcllnthostdevice_vector
(#11631) @upsjpandas
to 1.5
(#11617) @galipremsagarif constexpr
(#11581) @ttnghialist(struct)
columns as map
type in orc writer (#11568) @galipremsagarskiprows
and num_rows
in read_orc
(#11522) @galipremsagarNULL_EQUALS
binary operation (#11520) @rwleearrow-9.x
(#11507) @galipremsagarmultibyte_split
(#11500) @upsjdask
and distributed
for development (#11492) @galipremsagarskiprows
and num_rows
in cudf.read_parquet
(#11480) @galipremsagardistinct_count
benchmark (#11473) @ttnghianunique
aggregation benchmark (#11472) @ttnghiamax
aggregation benchmark (#11464) @ttnghiaBuffer
class (#11447) @madsbkPublished by GPUtester about 2 years ago
numpy
to <1.23
(#11824) @galipremsagarlists::drop_list_duplicates
(#11236) @ttnghia3.7
in code-base (#11029) @galipremsagarcolumns
option (#11018) @vuule.ptr
read-only (#10872) @madsbkcudf::detail::label_segments
(#11497) @ttnghiadistributed
error related to loop_in_thread
(#11428) @galipremsagarmoto
timeouts (#11369) @galipremsagar+/-infinity
as the identity
values for floating-point numbers in device operators min
and max
(#11357) @ttnghiaListSeries
(#11355) @thomcomDatetimeIndex
& TimedeltaIndex
constructors (#11342) @galipremsagarcudf::detail::contains
(#11330) @ttnghiapytorch
to temporarily unblock from libcupti
errors (#11289) @galipremsagarcudf::detail::contains
(#11284) @ttnghiacategory
dtype (#11282) @galipremsagartests/rolling/empty_input_test
(#11238) @ttnghiahost_span<bitmask_type const*>
(#11220) @ttnghianvcompBatchedDeflateDecompressGetTempSizeEx
in cuIO (#11213) @vuulecudf.from_pandas
(#11142) @galipremsagar__cuda_array_interface__
failures (#11113) @galipremsagarread_parquet
(#11105) @galipremsagarcudatoolkit
pinning issue (#11070) @galipremsagarget_current_device_resource()
(#11047) @AtlantaPepsicolumns
option (#11018) @vuulecustom.js
(#11465) @galipremsagarwhere
docs (#11276) @brandon-b-millerget_row_data_range
(#11271) @vyasrlanguage
from None
to "en"
in docs build (#11133) @galipremsagarDataFrame.value_counts
(#11039) @galipremsagarcollect_set
to use cudf::distinct
and cudf::lists::distinct
(#11228) @ttnghiacudf.options
(#11193) @isVoidnth_element
for window functions (#11158) @mythrockslists::distinct
and cudf::detail::stable_distinct
(#11149) @ttnghianan_equality
in cudf::distinct
(#11118) @ttnghiasemi_anti_join
(#11100) @ttnghiaduplicate_keep_option
in cudf::distinct
(#11052) @ttnghianumpy
to <1.23
(#11824) @galipremsagardask
& distributed
for release (#11433) @galipremsagardoxygen
(#11430) @galipremsagarcudf
recipes syntax (#11273) @ajschmidt8cudf
recipe (#11267) @ajschmidt8lists::drop_list_duplicates
(#11236) @ttnghiacudf::lists::distinct
in Python binding (#11234) @ttnghiacudf::lists::distinct
in Java binding (#11233) @ttnghiacudf::distinct
in Java binding (#11232) @ttnghiadask-cuda
in dev environment (#11229) @galipremsagarskiprows
& num_rows
in parquet reader (#11218) @galipremsagarcudf::contains
when searching for a scalar (#11202) @ttnghiacuda-python
to 11.7.0
(#11174) @Ethylingnvcc
conda package in dev environment (#11154) @galipremsagar_from_data_like_self
factory (#11140) @vyasrskbuild
directory into build.sh
clean
operation (#11060) @galipremsagardask
& distributed
for development (#11058) @galipremsagarSeries.between
(#11051) @galipremsagar3.7
in code-base (#11029) @galipremsagarlists::contains
(#11019) @ttnghiaiterator.cuh
and add fixed point support for scalar_optional_accessor
(#10999) @ttnghiacudf::contains
(#10997) @ttnghiadistinct
by using cuco::static_map::retrieve_all
(#10916) @PointKernel.ptr
read-only (#10872) @madsbkcudf.core.groupby.Grouper
objects to dask grouper_dispatch
(#10838) @brandon-b-millerarrow-8
(#10816) @galipremsagarPublished by GPUtester about 2 years ago
lists::drop_list_duplicates
(#11236) @ttnghia3.7
in code-base (#11029) @galipremsagarcolumns
option (#11018) @vuule.ptr
read-only (#10872) @madsbkdistributed
error related to loop_in_thread
(#11428) @galipremsagarmoto
timeouts (#11369) @galipremsagar+/-infinity
as the identity
values for floating-point numbers in device operators min
and max
(#11357) @ttnghiaListSeries
(#11355) @thomcomDatetimeIndex
& TimedeltaIndex
constructors (#11342) @galipremsagarcudf::detail::contains
(#11330) @ttnghiapytorch
to temporarily unblock from libcupti
errors (#11289) @galipremsagarcudf::detail::contains
(#11284) @ttnghiacategory
dtype (#11282) @galipremsagartests/rolling/empty_input_test
(#11238) @ttnghiahost_span<bitmask_type const*>
(#11220) @ttnghianvcompBatchedDeflateDecompressGetTempSizeEx
in cuIO (#11213) @vuulecudf.from_pandas
(#11142) @galipremsagar__cuda_array_interface__
failures (#11113) @galipremsagarread_parquet
(#11105) @galipremsagarcudatoolkit
pinning issue (#11070) @galipremsagarget_current_device_resource()
(#11047) @AtlantaPepsicolumns
option (#11018) @vuulewhere
docs (#11276) @brandon-b-millerget_row_data_range
(#11271) @vyasrlanguage
from None
to "en"
in docs build (#11133) @galipremsagarDataFrame.value_counts
(#11039) @galipremsagarcollect_set
to use cudf::distinct
and cudf::lists::distinct
(#11228) @ttnghiacudf.options
(#11193) @isVoidnth_element
for window functions (#11158) @mythrockslists::distinct
and cudf::detail::stable_distinct
(#11149) @ttnghianan_equality
in cudf::distinct
(#11118) @ttnghiasemi_anti_join
(#11100) @ttnghiaduplicate_keep_option
in cudf::distinct
(#11052) @ttnghiadask
& distributed
for release (#11433) @galipremsagardoxygen
(#11430) @galipremsagarcudf
recipes syntax (#11273) @ajschmidt8cudf
recipe (#11267) @ajschmidt8lists::drop_list_duplicates
(#11236) @ttnghiacudf::lists::distinct
in Python binding (#11234) @ttnghiacudf::lists::distinct
in Java binding (#11233) @ttnghiacudf::distinct
in Java binding (#11232) @ttnghiadask-cuda
in dev environment (#11229) @galipremsagarskiprows
& num_rows
in parquet reader (#11218) @galipremsagarcudf::contains
when searching for a scalar (#11202) @ttnghiacuda-python
to 11.7.0
(#11174) @Ethylingnvcc
conda package in dev environment (#11154) @galipremsagar_from_data_like_self
factory (#11140) @vyasrskbuild
directory into build.sh
clean
operation (#11060) @galipremsagardask
& distributed
for development (#11058) @galipremsagarSeries.between
(#11051) @galipremsagar3.7
in code-base (#11029) @galipremsagarlists::contains
(#11019) @ttnghiaiterator.cuh
and add fixed point support for scalar_optional_accessor
(#10999) @ttnghiacudf::contains
(#10997) @ttnghiadistinct
by using cuco::static_map::retrieve_all
(#10916) @PointKernel.ptr
read-only (#10872) @madsbkcudf.core.groupby.Grouper
objects to dask grouper_dispatch
(#10838) @brandon-b-millerarrow-8
(#10816) @galipremsagar