Fix: XGBoost Not Working — Feature Name Mismatch, GPU Errors, and Early Stopping Changes

Q: How do I fix "XGBoost Not Working — Feature Name Mismatch, GPU Errors, and Early Stopping Changes"?

How to fix XGBoost errors — feature names mismatch, XGBoostError GPU training fails, use_label_encoder deprecated, eval_metric warning, early stopping moved to callback, ValueError for DMatrix, and sklearn API confusion.

The Error

You train a model and predict on new data — XGBoost refuses:

ValueError: feature_names mismatch: ['age', 'salary'] ['salary', 'age']
Training data did not have the following fields: salary

Or GPU training crashes immediately:

XGBoostError: [13:45:20] ../src/common/device_helpers.cuh:431: CUDA runtime error

Or you get deprecation warnings that fill the console:

WARNING: use_label_encoder is deprecated in 1.6 and will be removed in a future release.
WARNING: Parameters: { "silent" } are not used.

Or early stopping doesn’t work the way it used to:

TypeError: __init__() got an unexpected keyword argument 'early_stopping_rounds'

XGBoost moves fast — the API changed significantly between 1.x and 2.x. Deprecated parameters, renamed arguments, and a restructured callback system break working code silently or loudly depending on which version you upgrade to. This guide covers each breaking change and the fix.

Why This Happens

XGBoost wraps a C++ core library with Python bindings. The Python API has two interfaces: the native Learning API (xgb.train(), DMatrix) and the scikit-learn compatible API (XGBClassifier, XGBRegressor). Both interfaces expose the same underlying engine but handle parameters differently — and both have changed between major versions.

Feature name validation was added in XGBoost 1.4. GPU support requires CUDA-compiled binaries. Early stopping moved from a constructor parameter to a callback in 2.0. Understanding which version you’re on is the first step to fixing any XGBoost error.

Platform and Environment Differences

XGBoost ships as a single pip package but different wheels target different platforms, and the runtime path (CPU vs GPU, sklearn API vs Booster API, distributed via Dask/Ray/Spark) changes everything from memory layout to error messages.

CPU vs GPU build. The default pip install xgboost from PyPI 2.0+ includes both CPU and GPU code paths in one wheel — but the GPU path requires a CUDA-capable NVIDIA driver at runtime. Without one, device='cuda' raises XGBoostError: No CUDA-capable device is detected even though the import succeeded. On older XGBoost (1.x) you had to install xgboost-gpu separately. Check the build with xgb.build_info() in 2.0+ — it prints the compile flags including whether CUDA support is enabled.

Apple Silicon ARM64 wheels. XGBoost 1.5+ ships native arm64 wheels for macOS, so an M1/M2/M3 with arm64 Python gets a native binary. Earlier versions force Rosetta x86_64 emulation that runs at ~40% speed. There is no GPU support on Apple Silicon — device='cuda' always fails because the build doesn’t include Metal acceleration. Use the CPU path with tree_method='hist' for best performance.

Distributed via Dask, Ray, and Spark. xgboost.dask and xgboost.spark provide cluster training. The error surfaces differ: Dask fails with KilledWorker when a partition exceeds memory; Spark fails with TaskFailedException and the actual XGBoost error is buried in executor logs. On Ray with xgboost_ray, the cluster-level config (num_actors, cpus_per_actor) overrides the XGBoost params — setting n_jobs=8 in XGBClassifier does nothing because Ray decides per-actor concurrency. For partition-sizing and serialization issues that affect distributed training, see Dask not working.

ONNX export per platform. xgboost.Booster.save_model() writes XGBoost’s native format, which is cross-platform. ONNX export via onnxmltools or skl2onnx is one-directional and platform-portable — but ONNX Runtime’s XGBoost converter only supports the sklearn API (XGBClassifier), not the native Booster API. If you trained with xgb.train(), wrap the result in XGBClassifier before exporting:

import xgboost as xgb
booster = xgb.train(params, dtrain, num_boost_round=100)

# Wrap for ONNX export
clf = xgb.XGBClassifier()
clf._Booster = booster
# Now skl2onnx can export it

For ONNX runtime issues across deployment targets, see ONNX not working.

sklearn API vs Booster API. The sklearn API (XGBClassifier, XGBRegressor) accepts pandas DataFrames, numpy arrays, and uses scikit-learn-style fit/predict signatures. The native Booster API (xgb.train(), DMatrix) is lower-level, faster for large datasets (skips DataFrame conversion overhead), but has fewer convenience features. A subtle bug: the sklearn API auto-encodes string labels for classification; the Booster API does not, so you must encode labels manually.

Conda vs pip wheel performance. conda install -c conda-forge xgboost and pip install xgboost are not identical. Conda links against MKL by default; pip links against OpenBLAS. On Intel CPUs, MKL is ~15–30% faster for the histogram tree builder. On AMD CPUs and ARM, the difference disappears. If training is unexpectedly slow on a fast Intel machine, check xgb.build_info()['USE_OPENMP'] and LD_PRELOAD for the BLAS library being used.

Fix 1: Feature Name Mismatch

ValueError: feature_names mismatch: ['f0', 'f1', 'f2'] ['age', 'salary', 'score']

XGBoost 1.4+ validates that prediction data has the same feature names and order as training data. This is strict — even reordered columns fail.

Common causes and fixes:

import xgboost as xgb
import pandas as pd
import numpy as np

# Training data as DataFrame (has column names)
X_train = pd.DataFrame({'age': [25, 30], 'salary': [50000, 60000], 'score': [80, 90]})
y_train = [0, 1]

model = xgb.XGBClassifier()
model.fit(X_train, y_train)

# WRONG — columns in different order
X_test = pd.DataFrame({'salary': [55000], 'age': [28], 'score': [85]})
model.predict(X_test)   # ValueError: feature_names mismatch

# CORRECT — reorder columns to match training data
X_test = X_test[X_train.columns]
model.predict(X_test)   # Works

# WRONG — NumPy array has no column names but model expects them
X_test_array = np.array([[28, 55000, 85]])
model.predict(X_test_array)   # May warn or error depending on version

# CORRECT — convert to DataFrame with matching columns
X_test_df = pd.DataFrame(X_test_array, columns=X_train.columns)
model.predict(X_test_df)

When loading a saved model, you must know the original feature names:

import xgboost as xgb
import json

# Save model (includes feature names in metadata)
model.save_model('model.json')

# Load and check stored feature names
loaded = xgb.XGBClassifier()
loaded.load_model('model.json')
print(loaded.get_booster().feature_names)   # ['age', 'salary', 'score']

# Now ensure prediction data matches these names

Disable feature name validation (not recommended, but useful for debugging):

import xgboost as xgb

model = xgb.XGBClassifier()
model.fit(X_train, y_train)

# Predict using DMatrix directly — bypasses feature name check
dmatrix = xgb.DMatrix(X_test.values)   # .values strips column names
model.get_booster().predict(dmatrix)

Fix 2: GPU Training Errors

XGBoostError: [13:45:20] ../src/common/device_helpers.cuh:431:
Check failed: e == cudaSuccess (100 vs. 0) : no CUDA-capable device is detected

XGBoostError: GPU support is not available for current XGBoost installation.

Step 1: Install the GPU-enabled XGBoost package.

The default pip install xgboost installs CPU-only. GPU requires a CUDA build:

# CPU only (default)
pip install xgboost

# GPU support — install with CUDA
pip install xgboost --extra-index-url https://pypi.nvidia.com
# Or for specific CUDA version
pip install xgboost-gpu   # Older approach, deprecated

Step 2: Verify CUDA is available:

nvidia-smi   # Check GPU is detected
nvcc --version   # Check CUDA compiler version

Step 3: Set the correct device parameter:

import xgboost as xgb

# XGBoost 2.0+ — use 'device' parameter
model = xgb.XGBClassifier(
    device='cuda',        # Use GPU
    n_estimators=100,
    max_depth=6,
    learning_rate=0.1,
)
model.fit(X_train, y_train)

# XGBoost 1.x — use 'tree_method' and 'gpu_id'
model = xgb.XGBClassifier(
    tree_method='gpu_hist',   # Deprecated in 2.0
    gpu_id=0,                 # Deprecated in 2.0
    n_estimators=100,
)

# Native API — DMatrix + params
dtrain = xgb.DMatrix(X_train, label=y_train)
params = {
    'device': 'cuda',       # XGBoost 2.0+
    'objective': 'binary:logistic',
    'max_depth': 6,
    'eta': 0.1,
}
model = xgb.train(params, dtrain, num_boost_round=100)

Multi-GPU training (XGBoost 2.0+):

model = xgb.XGBClassifier(
    device='cuda',
    n_estimators=1000,
    # XGBoost auto-detects multiple GPUs for data parallel training
)

Fall back to CPU gracefully:

import xgboost as xgb

try:
    model = xgb.XGBClassifier(device='cuda', n_estimators=100)
    model.fit(X_train, y_train)
except xgb.core.XGBoostError:
    print("GPU not available, falling back to CPU")
    model = xgb.XGBClassifier(device='cpu', n_estimators=100)
    model.fit(X_train, y_train)

For CUDA setup and GPU detection patterns across ML frameworks, see PyTorch not working.

Fix 3: Deprecated Parameters — `use_label_encoder`, `silent`, `gpu_hist`

WARNING: use_label_encoder is deprecated in 1.6 and will be removed in a future release.
WARNING: Parameters: { "silent" } are not used.

XGBoost has deprecated or removed several parameters across versions. Remove them to suppress warnings and avoid future breakage.

Full deprecation table:

Deprecated	Removed in	Replacement
`use_label_encoder=False`	2.0	Removed entirely — XGBoost never uses label encoder now
`silent=True`	1.6	`verbosity=0`
`tree_method='gpu_hist'`	2.0	`device='cuda'`
`gpu_id=0`	2.0	`device='cuda:0'`
`predictor='gpu_predictor'`	2.0	`device='cuda'`
`n_jobs` (native API)	—	Only in sklearn API; native API uses `nthread` param
`eval_metric` as string list	—	Still works but use `eval_metric=['logloss', 'auc']` (list syntax)

Clean XGBoost 2.0+ constructor:

import xgboost as xgb

# WRONG — full of deprecated params
model = xgb.XGBClassifier(
    use_label_encoder=False,     # Removed in 2.0
    silent=True,                 # Removed in 1.6
    tree_method='gpu_hist',      # Deprecated in 2.0
    gpu_id=0,                    # Deprecated in 2.0
    eval_metric='logloss',
    n_estimators=100,
)

# CORRECT — XGBoost 2.0+
model = xgb.XGBClassifier(
    device='cuda',               # Replaces tree_method/gpu_id
    verbosity=0,                 # Replaces silent
    eval_metric='logloss',
    n_estimators=100,
    max_depth=6,
    learning_rate=0.1,
)

Check your XGBoost version:

import xgboost
print(xgboost.__version__)   # 2.x.x or 1.x.x

Fix 4: Early Stopping Changed in XGBoost 2.0

TypeError: __init__() got an unexpected keyword argument 'early_stopping_rounds'

In XGBoost 2.0, early_stopping_rounds was removed from the constructor of XGBClassifier/XGBRegressor. It moved to the fit() method or a callback.

XGBoost 1.x pattern (broken in 2.0):

# WRONG in 2.0 — early_stopping_rounds not in constructor
model = xgb.XGBClassifier(
    n_estimators=1000,
    early_stopping_rounds=50,   # TypeError in 2.0
)
model.fit(X_train, y_train, eval_set=[(X_val, y_val)])

XGBoost 2.0+ pattern — pass to fit():

import xgboost as xgb

model = xgb.XGBClassifier(
    n_estimators=1000,
    learning_rate=0.05,
    max_depth=6,
    eval_metric='logloss',
)

model.fit(
    X_train, y_train,
    eval_set=[(X_val, y_val)],
    verbose=True,   # Print eval metric each round
)

# Set early stopping via set_params or callback
model.set_params(early_stopping_rounds=50)
model.fit(
    X_train, y_train,
    eval_set=[(X_val, y_val)],
    verbose=50,   # Print every 50 rounds
)

Or use the callback API directly:

import xgboost as xgb
from xgboost.callback import EarlyStopping

model = xgb.XGBClassifier(
    n_estimators=1000,
    learning_rate=0.05,
    callbacks=[EarlyStopping(rounds=50, save_best=True)],
)

model.fit(
    X_train, y_train,
    eval_set=[(X_val, y_val)],
)

# model.best_iteration — the round where training stopped
print(f"Best iteration: {model.best_iteration}")
print(f"Best score: {model.best_score}")

Pro Tip: Always set n_estimators high (1000–5000) and rely on early stopping to find the right number. This is faster than grid-searching n_estimators directly, because each training run stops at the optimal point instead of running to completion.

eval_set is required for early stopping — without it, XGBoost has no validation metric to monitor:

# WRONG — no eval_set, early stopping has nothing to monitor
model.fit(X_train, y_train)
# No error, but early stopping silently does nothing

# CORRECT
model.fit(
    X_train, y_train,
    eval_set=[(X_val, y_val)],   # Required for early stopping
)

Fix 5: `DMatrix` and Data Format Issues

ValueError: DataFrame.dtypes for data must be int, float, bool or category.

XGBoost’s DMatrix (the native data structure) only accepts numeric types. String columns, object columns, and unsupported dtypes cause errors.

import xgboost as xgb
import pandas as pd

# WRONG — string column can't go into DMatrix
df = pd.DataFrame({'age': [25, 30], 'city': ['NYC', 'LA']})
dtrain = xgb.DMatrix(df)   # ValueError

# CORRECT — encode categoricals before creating DMatrix
df['city'] = df['city'].astype('category')
dtrain = xgb.DMatrix(df, enable_categorical=True)   # XGBoost 1.6+

# Or one-hot encode first
df_encoded = pd.get_dummies(df, columns=['city'])
dtrain = xgb.DMatrix(df_encoded)

enable_categorical=True (XGBoost 1.6+) lets XGBoost handle Pandas category dtype natively — no one-hot encoding needed. This is faster and uses less memory for high-cardinality features:

import xgboost as xgb
import pandas as pd

# Convert string columns to category dtype
df['city'] = df['city'].astype('category')
df['country'] = df['country'].astype('category')

# XGBoost handles categories natively
model = xgb.XGBClassifier(
    enable_categorical=True,
    tree_method='hist',   # Required for categorical support
)
model.fit(df[features], y)

Missing values — XGBoost handles NaN natively in DMatrix. Don’t impute before training unless you have a specific reason:

import xgboost as xgb
import numpy as np

# NaN values are handled automatically — XGBoost learns optimal split direction
X = np.array([[1.0, np.nan], [2.0, 3.0], [np.nan, 4.0]])
y = [0, 1, 0]

dtrain = xgb.DMatrix(X, label=y)   # Works — NaN is a valid input
model = xgb.train({'max_depth': 3}, dtrain)

This is a major advantage over scikit-learn, where most estimators reject NaN input. Don’t preprocess away missing values just because sklearn’s habit says to — XGBoost handles them better than any imputer.

Fix 6: Hyperparameter Tuning Mistakes

Parameters that don’t do what you think:

import xgboost as xgb

model = xgb.XGBClassifier(
    n_estimators=100,     # Number of boosting rounds (trees)
    max_depth=6,          # Max tree depth — higher = more overfitting
    learning_rate=0.1,    # Shrinkage factor — lower = need more trees
    min_child_weight=1,   # Min samples in a leaf — higher = more conservative
    subsample=0.8,        # Row sampling per tree — prevents overfitting
    colsample_bytree=0.8, # Column sampling per tree — prevents overfitting
    reg_alpha=0,          # L1 regularization (lasso)
    reg_lambda=1,         # L2 regularization (ridge)
    gamma=0,              # Min loss reduction for split — higher = more conservative
    scale_pos_weight=1,   # Class imbalance: set to sum(negative) / sum(positive)
)

scale_pos_weight for imbalanced classes:

import numpy as np

# Calculate from data
neg_count = np.sum(y_train == 0)
pos_count = np.sum(y_train == 1)
scale = neg_count / pos_count

model = xgb.XGBClassifier(
    scale_pos_weight=scale,   # Balances the loss for minority class
    eval_metric='auc',        # AUC is better than accuracy for imbalanced data
)

Common Mistake: Setting learning_rate=0.3 (the default) with n_estimators=10000. A high learning rate with many trees overfits aggressively. Use the rule of thumb: lower learning rate (0.01–0.1) with more trees and early stopping to find the optimal number.

Feature importance — which method to use:

import xgboost as xgb

model.fit(X_train, y_train)

# Built-in importance (based on split gain by default)
importance = model.get_booster().get_score(importance_type='gain')
# Options: 'weight' (split count), 'gain' (avg gain), 'cover' (avg samples)

# Or use the sklearn-compatible property
print(model.feature_importances_)   # Array of importance scores

# Plot
xgb.plot_importance(model, importance_type='gain', max_num_features=15)

Fix 7: Model Saving and Loading

Save formats:

import xgboost as xgb

model = xgb.XGBClassifier()
model.fit(X_train, y_train)

# JSON format (recommended — human-readable, portable)
model.save_model('model.json')
loaded = xgb.XGBClassifier()
loaded.load_model('model.json')

# Binary format (smaller file, faster load)
model.save_model('model.ubj')   # Universal Binary JSON

# Pickle (works but not recommended for cross-version compatibility)
import pickle
with open('model.pkl', 'wb') as f:
    pickle.dump(model, f)

Cross-version compatibility — models saved in XGBoost 1.x can be loaded in 2.x, but not vice versa. Always save in JSON format for maximum portability:

# Best practice: save model + version info
import json
import xgboost as xgb

model.save_model('model.json')

metadata = {
    'xgboost_version': xgb.__version__,
    'feature_names': list(X_train.columns),
    'target_name': 'is_fraud',
    'training_date': '2025-03-15',
}
with open('model_metadata.json', 'w') as f:
    json.dump(metadata, f)

Loading in a sklearn Pipeline:

from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
import xgboost as xgb
import joblib

pipeline = Pipeline([
    ('scaler', StandardScaler()),
    ('model', xgb.XGBClassifier(n_estimators=100)),
])
pipeline.fit(X_train, y_train)

# Save entire pipeline (includes scaler + model)
joblib.dump(pipeline, 'pipeline.joblib')
loaded_pipeline = joblib.load('pipeline.joblib')

Fix 8: Multi-Class and Multi-Output

Multi-class classification — XGBoost handles it natively with objective='multi:softmax' or multi:softprob:

import xgboost as xgb

# For class labels 0, 1, 2, 3
model = xgb.XGBClassifier(
    objective='multi:softmax',   # Returns class label (0, 1, 2, ...)
    num_class=4,
    n_estimators=100,
)
model.fit(X_train, y_train)
predictions = model.predict(X_test)   # [0, 2, 1, 3, ...]

# For probability distributions
model = xgb.XGBClassifier(
    objective='multi:softprob',  # Returns probability per class
    num_class=4,
)
model.fit(X_train, y_train)
probas = model.predict_proba(X_test)  # shape (n_samples, 4)

Label encoding — XGBoost requires integer labels starting from 0:

from sklearn.preprocessing import LabelEncoder

le = LabelEncoder()
y_encoded = le.fit_transform(y_train)   # ['cat', 'dog', 'fish'] → [0, 1, 2]

model.fit(X_train, y_encoded)
predictions = model.predict(X_test)
labels = le.inverse_transform(predictions)   # [0, 1, 2] → ['cat', 'dog', 'fish']

Still Not Working?

XGBoost vs LightGBM vs CatBoost

If XGBoost is slow on your dataset, consider alternatives:

LightGBM — faster training on large datasets via leaf-wise growth (vs. XGBoost’s level-wise). Better for high-cardinality categoricals.
CatBoost — best native categorical support (no encoding needed), slower training but often competitive accuracy.
XGBoost — most mature, broadest hardware support (GPU, distributed), largest community.

All three are gradient boosting frameworks with similar accuracy. The choice usually comes down to speed and feature handling requirements.

Integration with scikit-learn Pipelines

XGBoost’s sklearn API (XGBClassifier, XGBRegressor) is fully compatible with scikit-learn’s Pipeline, GridSearchCV, and cross_val_score:

from sklearn.model_selection import GridSearchCV
import xgboost as xgb

param_grid = {
    'max_depth': [3, 5, 7],
    'learning_rate': [0.01, 0.05, 0.1],
    'n_estimators': [100, 500],
    'subsample': [0.8, 1.0],
}

search = GridSearchCV(
    xgb.XGBClassifier(eval_metric='logloss'),
    param_grid,
    cv=5,
    scoring='roc_auc',
    n_jobs=-1,
)
search.fit(X_train, y_train)
print(search.best_params_)

For sklearn Pipeline patterns, data leakage prevention during cross-validation, and ColumnTransformer integration with XGBoost, see scikit-learn not working.

Installation Fails on ARM or Unusual Platforms

# Pre-built wheels cover most platforms
pip install xgboost

# If pip builds from source and fails
pip install xgboost --no-build-isolation

# With conda (includes all native dependencies)
conda install -c conda-forge xgboost

If the conda wheel still fails, try mamba install xgboost — it’s faster and resolves dependency conflicts that block conda.

NumPy Array Shape Issues

XGBoost expects 2D arrays (n_samples, n_features). A 1D array triggers a reshape error:

import numpy as np
X = np.array([1, 2, 3, 4])           # 1D, fails
X = X.reshape(-1, 1)                  # 2D, works

For a single sample at predict time, use X.reshape(1, -1). The same applies to pandas — model.predict(df.iloc[0]) fails because .iloc[0] returns a Series; model.predict(df.iloc[[0]]) returns a 1-row DataFrame.

Memory Errors With Large Datasets

DMatrix materializes the data in C++ memory — a 10 GB pandas DataFrame becomes another 10 GB of DMatrix memory, doubling peak usage. Use xgb.QuantileDMatrix (XGBoost 2.0+) to bucket continuous features into 256 bins, cutting memory by 4–8x with negligible accuracy loss. For external memory (data larger than RAM), use the ExtMemQuantileDMatrix interface that streams from disk.

Cross-Platform Model Reproducibility

Models trained on one platform can produce slightly different predictions on another due to floating-point determinism. To get bit-exact results across platforms, set random_state=42 AND tree_method='hist' AND keep the same XGBoost version. The approx tree method is non-deterministic by design and will produce different splits even with a fixed seed.

Fix: XGBoost Not Working — Feature Name Mismatch, GPU Errors, and Early Stopping Changes

The Error

Why This Happens

Platform and Environment Differences

Fix 1: Feature Name Mismatch

Fix 2: GPU Training Errors

Fix 3: Deprecated Parameters — `use_label_encoder`, `silent`, `gpu_hist`

Fix 4: Early Stopping Changed in XGBoost 2.0

Fix 5: `DMatrix` and Data Format Issues

Fix 6: Hyperparameter Tuning Mistakes

Fix 7: Model Saving and Loading

Fix 8: Multi-Class and Multi-Output

Still Not Working?

XGBoost vs LightGBM vs CatBoost

Integration with scikit-learn Pipelines

Installation Fails on ARM or Unusual Platforms

NumPy Array Shape Issues

Memory Errors With Large Datasets

Cross-Platform Model Reproducibility

Related Articles

Fix: LightGBM Not Working — Installation Errors, Categorical Features, and Training Issues

Fix: Jupyter Notebook Not Working — Kernel Dead, Module Not Found, and Widget Errors

Fix: NumPy Not Working — Broadcasting Error, dtype Mismatch, and Array Shape Problems

Fix: scikit-learn Not Working — NotFittedError, NaN Input, Pipeline, and ConvergenceWarning

The Error

Why This Happens

Platform and Environment Differences

Fix 1: Feature Name Mismatch

Fix 2: GPU Training Errors

Fix 3: Deprecated Parameters — use_label_encoder, silent, gpu_hist

Fix 4: Early Stopping Changed in XGBoost 2.0

Fix 5: DMatrix and Data Format Issues

Fix 6: Hyperparameter Tuning Mistakes

Fix 7: Model Saving and Loading

Fix 8: Multi-Class and Multi-Output

Still Not Working?

XGBoost vs LightGBM vs CatBoost

Integration with scikit-learn Pipelines

Installation Fails on ARM or Unusual Platforms

NumPy Array Shape Issues

Memory Errors With Large Datasets

Cross-Platform Model Reproducibility

Related Articles

Fix: LightGBM Not Working — Installation Errors, Categorical Features, and Training Issues

Fix: Jupyter Notebook Not Working — Kernel Dead, Module Not Found, and Widget Errors

Fix: NumPy Not Working — Broadcasting Error, dtype Mismatch, and Array Shape Problems

Fix: scikit-learn Not Working — NotFittedError, NaN Input, Pipeline, and ConvergenceWarning

Fix 3: Deprecated Parameters — `use_label_encoder`, `silent`, `gpu_hist`

Fix 5: `DMatrix` and Data Format Issues