Distributed Inference and Fine-tuning of Large Language Mode